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Erythropoietin Muteins With Enhanced Activity 

This invention was made with Government Support under grant nos. 
R01-HU2949 and R01-GM39900 sponsored by the National Institute of 
Health. The Government has certain rights in the invention. 

5 Cross-Reference to Related Applications 

This application is a continuation-in-part of U.S. Patent Application 
Serial No. 08/049,802 filed April 21, 1993 which is incorporated herein by 
reference. 

Background 

10 This invention relates to erythropoietin in general and more particularly 

to modified erythropoietin proteins (erythropoietin muteins) having improved 
biological activity. 

Erythropoietin (hereafter referred to as Epo) is a glycoprotein hormone 
which, in its mature form in humans, is 166 amino acids long with a 

15 molecular weight of 34 to 38 kD (Jacobs et ai. Nature 313: 806-809 (1985). 
Epo occurs in a, j3, and asialo forms which differ slightly in their 
carbohydrate composition and biological activity (Dordal etal.. Endocrinology 
776: 2293-2299 (1985)). 

In terms of biological function, Epo has been recognized as the 

20 hematopoietic cytokine that regulates the process of red blood cell production, 
known as erythropoiesis. Erythropoiesis is a controlled physiological process 
which normally produces red blood cells in numbers which do not impede 
blood flow, but which are sufficient for oxygen transport. 

The binding of Epo to its cognate receptor (D'Andrea, A.D,, et al, 

25 Cell 57:277-285 (1989)) on erythroid precursor cells in the bone marrow 
results in salvaging these cells from programmed cell death, known as 
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apoptosis (Koury, M.J., et aL. Science 2-«:378-381 (1990)), allowing them 
to proliferate and differentiate into circulating erythrocytes (red blood cells). 
Epo thus facilitates the formation of erythrocytes from erythroid precursor 
cells. 

5 The level of ^o in circulation normally controls the rate of red blood 

cell production in the body. Typically. Epo is present at a low plasma 
concentration sufficient to maintain a steady state concentration of red blood- 
cells by stimulating formation of just enough red blood cells to replace those 
lost through normal processes, such as aging. However, when an increase in 
10 the production of red blood cells is needed, the amount of Epo in circulation 
is increased. This red blood cell deficit may be caused, for example, by 
anemia, the general loss of blood through hemorrhage, over-exposure to 
radiation, prolonged unconsciousness, or exposure to high altitudes where 
oxygen intake is reduced. 
15 In mammals, Epo is produced in the fetal liver and adult kidney, 

circulates in the bloodstream, and binds to receptors on committed progenitor 
cells in the bone marrow and other hematopoietic tissues, resulting in 
proliferation and terminal maturation of erythroid cells (Jelkmann, W., 
Physiol. Rev, 72:449 (1992)). The expression of both mRNA and 
20 protein, is markedly increased by hypoxia, owing to a 3' enhancer and to 
highly conserved elements in the promoter region (Semenza, GX. etal,, 
Proc. Natl Acad. Sci. USA 58:5680-1684 (1988); Beck. 1. etaL, J. Biol 
Oiem. 266:15563-15566 (1991); Pugh, C.W. etal., Proc. Natl Acad. Sci. 
USA 88:10553-10557; Blanchard, K.L. etal. Mol Cell Biol 72:5373-5385 
25 (1992)). This elegant servomechanism enables Epo to regulate the red cell 
mass of man and other animals. 

Genes encoding Epo from a number of species have been studied. 
Thus far, the Epo genes of man (Jacobs, K. et al. Nature 373:806-10 (1985); 
Lin. F.-K. etal, Proc. Natl Acad. ScL USA 82:7580-7584 (1985)). a 
30 monkey {Macacafascicularis) (Lin, F.-K. et al , Gene 44:201-209 (1986)) and 
a rodent, the mouse (McDonald, J.D. et al, Mol & Cell Biol 6:842-848 
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(1986); Shoemaker, C.B. e/ fl/., MoL & Cell Biol. 6:849-858 (1986)) have 
been cloned, sequenced, and expressed. There is a high degree of sequence 
homology in the coding region of the mature secreted proteins from these 
species. 

5 In particular, the gene encoding human Epo has been cloned and used 

to produce recombinant Epo in cell culture. Recombinantly produced human 
Epo has general utility as a substitute for Epo isolated from natural sources. • 
In addition, recombinant Epo has facilitated the production of large amounts 
of Epo free from deleterious substances associated with naturally derived Epo, 

10 which has allowed for greater availability and utility of this protein. 

Recombinantly produced Epo has proven especially useful for the 
treatment of patients suffering from impaired red blood cell production 
(Physicians Desk Reference (PDR), 1993 edition, pp 602-605). Recombinant 
Epo has proven effective in treating anemia associated with chronic renal 

15 failure and HIV-infected individuals suffering from lowered endogenous ^ 
levels related to therapy with Zidovudine (AZT) (See PDR, 1993 edition, at 
page 602). 

Modifications of the Epo protein which would improve its utility as a 
tool for diagnosis or treatment of blood disorders are certainly desirable. In 

20 particular, modified forms of Epo exhibiting enhanced biological activity 
would be more effective and efficient tiian native Bfo in tiie tiier^y setting 
when it is necessary to administer Epo to the patient, enabling administration 
less frequentiy and/or at a lower dose. Administration of reduced amounts of 
^o would also presumably reduce tiie risk of adverse effects associated witf) 

25 ^0 treatment, such as hypertension, seizures, headaches, etc. (See PDR, 
1993 edition, at pp. 603-6(H). 

Unfortunately, available information regarding the structure of the Epo 
enzyme and its function does not permit the accurate prediction of 
modifications which would result in such improvements. In fact, the highly 

30 conserved nature of the amino acid sequence of Epo indicates that this protein 
is, for tiie most part, resistant to change and tiiat most modifications would be 
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expected to have deleterious effects on Epo and its activity (See U.S. Patent 
No. 4,835,260 by Shoemaker). 

Some speculation does, however, exist regarding the possibility of 
creating variants of Epo with altered characteristics that would be preferable 
5 to the wild type protein for particular purposes. A number of modifications 
of Epo have been proposed, including deletions, additions, and substitution of 
existing amino acids. These modifications have been prqwsed as a means of • 
achieving a variety of effects, including separation of various Epo functions 
on individual protein fragments, improving the efficacy of recombinant l^o 
10 production, improving in vivo stability, etc. Of particular relevance to die 
present invention are tiiose modifications proposed to improve tiie biological 
activity (i.e. ability to stimulate red blood cell production) of Epo. 

U.S. Patent No. 4,703,008 by Lin, F-K. (hereinafter referred to as 
"die '008 patent") speculates about a wide variety of modifications of Epo, 
15 including addition, deletion, and substitution analogs of Epo. The *008 patent 
does not indicate that any of tiie suggested modifications would increase 
biological activity per 5e, altiiough it is stated that deletion of glycosylation 
sites might increase the activity of Epo produced in yeast (See tiie '008 patent 
. at column 37. lines 25-28). Also, die '008 patent speculates tiiat Epo analogs 
20 which have one or more tyrosine residues replaced witii phenylalanine may 
exhibit an increased or decreased recq)tor binding affinity. 

Australian Patent Application No. AU-A-59145/90 by Fibi, M et aL 
(herein referred to as "Fibi") also discusses a number of modified Epo proteins 
(^0 muteins). Fibi generally speculates about die alteration of amino acids 
25 10-55. 70-85, and 130-166 of Epo. In particular, additions of positively 
charged basic amino acids in die carboxyl terminal region are purponed to 
increase die biological activity of Epo. 

U.S. Patent No. 4,835,260 by Shoemaker, C. B. discusses modified 
Epo proteins widi amino acid substitutions of tfie mediionine at position 54 and 
30 asparagine at position 38. Such Epo muteins are tiiought to have improved 
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Stability but are not proposed to exhibit any increase in biological activity 
relative to wild type Epo. 

Beyond the few proposed modifications that have been speculated to 
enhance the biological activity of 1^, the art provides no further guidance 
S regarding how Epo can be modified to increase its biological activity. This 
lack of guidance indicates that, based upon the present state of knowledge, 
other modifications of Epo which may increase biological activity cannot be 
predicted and may not even exist. 

The present invention represents the inventors' achievement in 
10 overcoming the lack of guidance and unpredictability regarding modifications 
of the Epo protein which increase its biological activity. 

Summary Of The Invention 

It is therefore an object of the present invention to provide Epo muteins 
with enhanced biological activity. 
15 It is a further object of the present invention to provide recombinant 

DNA encoding Epo muteins and methods of producing biologically active ^o 
muteins in a host cell culture using such recombinant DNA. 

It is a further object of the present invention to provide methods of 
treating blood disorders involving abnormally low red blood cell populations 
20 using Epo muteins at dosages lower than that required for unmodified Epo to 
achieve the same affect. 

It is a further object of the present invention to provide a method of 
obtaining additional Epo muteins with enhanced biological activity. 
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Brief Description Of The Drawings 

Figure 1. Oligonucleotide Primer Sequences. Oligonucleotide 
primers corresponding to Epo DNA sequences are shown. The localization 
of the primer coincides with previously published nucleotide sequences for the 

5 human (Lin. F.-K. et al., Proc. Natl Acad, ScL USA 82:7580-7584 (1985)) 
and murine (McDonald, J.D. et aL, Mol. & Cell Biol. 6:842-848 (1986)) Epo 
genes. EX2R and EX5 are not completely conserved between human and 
mouse (respectively 93% and 95% identity). An equal amount of each 
possible nucleotide was incorporated during die corresponding cycles of those 

10 primer syntheses. EX2R and NCOl are reverse primers and their sequences 
represent the anusense DNA strand. 

Figure 2. PCR Strategy used for the cloning of maminalian 
cDNAs containing the complete coding sequence of the mature 
Erythropoietin Protein. Genomic amplification and sequencing of exonic 

15 fragments, localized upstream and downstream from the nucleotide portion 
coding for the mature protein, allowed the design of species specific primers 
(SP). Utilization of tiiose SP primers and/or of, sequences diat are 100% 
conserved between man and mouse 5' ATG and 3' NCOl primers (Figure 1) 
on cDNA templates prepared from kidney of uninduced or hypoxia-induced 

20 animals, allows the amplification of a large variety of mammalian Epo clones. 
Sense (-♦) and antisense (♦-) primers are represented by the arrows. Dashed 
boxes correspond to die coding part (propeptide and mature protein) of the 
five Epo exons. Gray boxes represent die 5' and 3' untranslated regions. 
Figure 3. Comparison of IVl/E^R sequences from various 

25 mammals. The numbering corresponds to die published human genomic 
sequence (Lin, F.-K. etal, Proc. Natl. Acad. ScL USA 52:7580-7584 
(1985)). The reported human sequence was obtained from the amplification 
of purified genomic DNA from Hep3B cells and agreed wiUi die sequence 
previously reported. The mouse sequence is from McDonald. J.D. et al, 

30 Mol. & Cell Biol. 6:842-848 (1986). The horse sequence was obtained from 
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kidney-extracted genomic DNA. All the other sequences were established 
from PGR amplification of genomic DNAs purified from several mammalian 
renal-derived cell lines. The consensus sequence indicates the positions where 
a unique nucleotide was found in all the reported species. The boundary 
5 between intron 1 and exon 2 is represented by the ascending arrows. The 
locations of the two PGR primers, IVI and EX2R are shown by the dashed 
arrows. 

Figure 4. Alignment of the nucleotide sequences of mammalian 
Epo cDNAs. The mouse sequence corresponds to the one previously reported 

10 (McDonald, J.D. et al.^MoL & CellBioL 6:842-848 (1986)). PCR-produced 
human sequence was obtained from hypoxia-induced Hep3B cell line and is in 
total agreement with the previously published sequence (Jacobs, K. etaL, 
Nature Ji3:806-810 (1985); Lin, F.-K. etai, Proc. Natl. Acad. Sci. USA 
82:7580-7584 (1985)). Monkey (Macaca mulatta), rat. sheep, pig and cat 

15 were amplified from kidney-purified cDNAs. Amplifications of the human, 
monkey, rat and sheep were realized using the ATG (residues 1 to 20) and 
NGOl (residues 723-742) primers. The line under residues 41 to 56 
represents the specific SPl primer used for the cloning of the pig and cat 
sequences. For these two mammals, 5' sequences for SPl are derived from 

20 the data presented in Figure 3. Arrows indicate the start and the end of the 
coding sequence (propeptide and mature protein). 

Figure 5, Predicted Amino Acid Sequences of Epo Propeptide. 
The amino acid sequences are derived from data obtained from both genomic 
IV1/EX2R and cDNA amplifications. The numbering corresponds to the 

25 human sequence. Ala 1 is the first amino acid of the human mature protein. 
The ascending arrows show the site of the cleavage by the signal peptidase, 
as determined for the human and cynomolgus monkey proteins. 

Figure 6, Alignment of the Primary Structures of mature 
mammalian Epo Proteins. The human. Macaca fascicularis and mouse 

30 amino acid sequences were previously reported (Lin, F.-K. et al, Proc. Natl. 
Acad. ScL USA 82:7580-7584 (1985); Lin. F.-K. etaL, Gene 4^:201-209 



wo 94/24160 



PCT/US94/04361 



-8- 

(1986);McDonald,J.D. e/fl/.,Afc?/. 4Ce//B/o/. 6:842^^ The 
residue numbers correspond to the 166 aa of the mature human protein. Plain 
boxes indicate the positions of the predicted four a-helices in the human 
sequence (See Example II). Dashed boxes show the N- and 0-glycosylation 

5 sites. Limits between exons are indicated by the vertical lines. 

Figure 6A. Schematic representation of the mammal 
erythropoietin. The invariant amino acids among the eight sequences shown 
in Figure 6 are represented by the black boxes. The localization of each of 
the four a-helices is underlined. The three glycosylation sites are shown as 

10 diamonds. The main disulfide bridge is indicated by the heavy line connecting 
noncontiguous sequences. The small arrows under the sequence indicate the 
short SH-bridge, missing in the rodents. 

Figures 7-78. Riylogenetic lineages derived from analyses of 
mammalian Epo cDNA sequences encoding 
15 the full length mature protems from seven 

species. 

Figure 7. Strength of groups in the maximum parsimony tree 
found on examining all 945 unrooted trees formed by seven terminal taxa. 
This tree of lowest length required 374 base substitutions. Each link between 

20 ancestral nodes has a circled number; this strength of grouping number is the 
minimum number of substitutions that must be added to the length of the 
maximum parsimony tree to find a tree dmt breaks down the barrier (moves 
one pr more sequences) between the two groups separated by the interior link. 
F^ure 7 A. The phylogenetic tree derived from the maximum 

25 parsimony reconstruction. On the basis of other molecular evidence 
involving comparative amino acid sequence data from monotremes. 
marsupials, and many eutherian species (Czelusniak, J. et al.. Ueth. EnzymoL 
783:601-615 (1990); Czelusniak, J. etaL, In "Current Mammology", 
Volume 2, 3rd ed., H. H. Ganoways, pp. 545-572, Plenum Publ. Corp. 

30 (1990)) the root of this Epo phylogenetic tree is placed on tiie interior link 
separating die artiodactyl group from the primate, rodent, cat group. The 
numbers on the intemodal links represent die numbers of base pairs by which 
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the nodal ancestral and descendant sequences differ. MYA = millions of 
years ago, as inferred from paleontological views of eutherian phylogeny. 

Figure 7B. Parsimony reconstruction on that portion of the 
cDNA sequences that are codons for amino acids. The numbers shown as 
5 a fraction on each link are. in the numerator, the number of amino acid 
changing base replacements and, in the denominator, the number of silent base 
replacements. The computer algorithm that carried out this calculation is 
described in Czelusniak, J. et fli.. Nature 298:297-300 (1982) (see the legend 
for Figure 2). 

10 Figures SSA. Phylogenetic Uneages derived from analyses of 

the mammalian E^o intron 1-exon 2 
sequences. 

Figure 8. Strength of groups in the maximum parsimony tree 
found on examining all 135,135 trees formed by nme temunal taxa. As 

15 the two feloid sequences (those of cat and lion) are nearly identical, they were 
treated as a single taxon in this strength of grouping analysis. The maximum 
parsimony tree for the segment (about 285 bp) of intron 1 and exon 2 from ten 
species required 361 base substitutions. The circled numbers are the strength 
of grouping numbers (as defined in the Figure 7 legend). 

20 Figure 8A. The near most parsimonious tree that groups dog 

withfeloids. This tree required 365 base substitutions. The numbers on links 
represent numbers of base pairs by which the nodal ancestral and descendant 
sequences differ. 

Figures 9-93. Model of the Three-Dimensional Structure of 
25 Erythropoietin. 

Figure 9. Ribbon diagram ofEpo tertiary structure. The four a 
helices are labeled A to D; loops between helices are apprq)riately named. 
Disulfide bridges are shown and N- and 0-glycosylaiion sites are indicated 
respectively by dark and dotted segments. 
30 Figure 9A, Schematic representation of Epo's primary stmcture 

depicting predicted up-up-down-down orientation of the four antiparallel a 
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helices (boxes with arrowhead). This folding pattern is strongly suggested by 
the large size of the two interconnecting loops AB and CD. The limits of each 
helix were drawn according to Table II of Example I. A predicted short 
region of /3-sheet is delineated by the dashed rectangle. The N-glycosylation 
5 sites are represented by the dotted diamonds, the 0-glycosylation site by the 
dashed oval. The locations of the two disulfide bridges are shown as solid 
lines. 

Figure 9B. Cross-section of the Epo molecule at the level of the 
four a heUces. The helical wheel projections are viewed from the NH,-end 

10 of each helix. The hydrophobic residues, localized inside the globular 
structure, are indicated by filled circles. The charged and neutral residues 
(open and gray circles respecUvely) are exposed at the surface of the molecule. 

Figure 10. Immunopredpitations of wild type Epo and the 
A140-144 mutein. Cos? cells were transfected with pSG5, pSG5-Epo/wt or 

15 pSG5-Epo/A140-144. After three days, the cells were metabolically labeled 
with pS]-methionine and ["S]-cysteine. Immunoprecipitations of cellular 
extracts and supcmatants were performed with our polyclonal antibody, raised 
in rabbit against the native human Epo. The immunoprecipitates were 
analyzed by SDS-polyacrylamide gel electrophoresis. Lanes 2 to 4 correspond 

20 to cellular extracts, and lanes 5 to 7 correspond to culture supematants, from 
Cos7 transformed with the following: plasmid without insert (lanes 2 and 5), 
wild type Epo (lanes 3 and 6). and A140-144 Oanes 4 and 7). Lane 1 
represents the protein molecular weight standard. The two arrows show the 
normal secretion of the wild type Epo (35-37 kD) and the cytoplasmic 

25 retention of the mutein A140= 144 ( - 28 kD). 
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Figures ll-llB. Interconnecting Loop AB. 
Figure IL Schematic representation of the loop AB showing the 
localization of mutebis with various deletions and amino acid 
replacements. The dashed arrows point to the positions of the serine 
5 substitutions (in A48-52). The two N-glycosylation sites are rq)rcsented by 
the gray diamonds. The small Cys29=Cys33 disulfide bridge is indicated. 

Figure 11 A. Amount and biological activities of secreted muteins. 
The upper bar graphs show the relative sccreuon of wild type and loop AB 
muteins as determined by radioimmunoassay. The lowest bar graphs display 
10 the calculated specific activity (ratio bioassay/RIA) for each mutein, in 
comparison with the value obtained for the wild type Epo (ratio = 100%). 

Figure IIB. HCDS7 cell proliferation as a function of increastag 
concentration of wild type and serine-substituted Epo muteins. HCD57 
cells (10* per ml) were cultured for three days in a 96 well microtiter plate 
15 with media containing increasing concentrations of secreted proteins 
(mU =milli-units of Epo, a relative mass measurement). The line graphs show 
the cellular growth as measured by ^H-thymidine uptake for cells cultured with 
wild type Epo (O), Epo mutein F48S (O), Epo mutein Y49S (X), Epo mutein 
A50S ( ♦ ), Epo mutein W51S (□), and Epo mutein K52S (a). The number 
20 of viable cells was also measured with the MTT colorimetric assay and gave 
similar curves. Proliferation experiments using the human UT-7/Epo cell line 
(Komatsu. N.. et aL, Cancer Res. 57:341-348 (1991)) and the Krystal assay 
(Krystal, G., Exp. HematoL 77:649-60 (1983)) produced identical results. 
Figures 12'12A. Interconnecting Loop CD. 
25 Figure 12. Schematic representation of the loop CD showing the 

location of three deletion mutems: A105-109, Alll-119, A122-126, and the 
insertion of seven residues after Lysll6 (myc epitope) . The 0-glycosylation 
site is indicated by the dashed oval. 

Figure 12A. Secretion and biological activities of the mutems 
30 located ta loop CD. The two bar graphs were created as described in 
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Figure 1 lA. The two mutants Al 1 1-1 19 and 1 16/myc were normally secreted 
and had full biological activities. 

Figures J3-13A. In Vitro Translation of the Epo WUd Type. 

Figure 13. Analysis of the ^'S-labeled translation products by 
5 SDS-PAGE. One-step transcription/translation reactions were performed in 
the SP6-TnT rabbit reticulocyte lysate system. 1/30 of each reaction was 
resolved on a 15% polyacrylamide gel. Lane Mow Mr standard from 
Amersham; lane 2-in vitro reaction without added plasmid; lanes 3 and 4 are 
translation products obtained after incubation of 1 iig of circular p64T-Epo, 
10 in the presence or absence of canine pancreatic microsomal membranes, 
respectively. 

Figure 13A. Binding of the in vitro translated Epo wUd type onto 
Epo Receptor-GTS-agarose beads. 6x\ff q)m (counts per minute) of 
purified "S-labeled erythropoietin products, processed with microsomes (+) 
15 or not (-) were incubated in the presence of the extra-cytoplasmic domain of 
the Epo receptor (EREJ, following the protocol described by Harris, K.W., 
etal.. J. Biol. am. 267:15205-15209 (1992). Identical binding 
demonstrated that the conservation of the propeptide did not impair flie 
hormone/receptor interaction. 
20 Figures 14-14A. COOH-end of Epo. 

• mgure 14. Schematic representation of the analyzed muteins, 
corresponding to die deletion of die four last amino acids A163-166 and die 
replacements of die residues 162 to 166 by a KDEL or poly (His) sequences. 
Figure 14A. Relative secretion of these muteins. The bioactivities 
25 in tiie supematants (Krystal. G., Exp. Hematol. 17:649-60 (1983)) and die cell 
extracts (Komatsu, N.. et a/.. Cancer Res. 57:341-348 (1991)) of transformed 
Cos7 cells were measured by in yntro proliferation assay using HCD57. More 
KDEL mutant remained in die cytosol of die Cos7, when compared wiUi die 
wild type Epo and A163-166 or poly (His) muteins. However, all die 
30 analyzed muteins had die same ^ific activity as diat of die wild type. 
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Figure 15. Bacterial Expression of WUd Type Epo. 
Figure 15 Panel A. Diagram of the fusion protein. An NH,- 
terminal 22 amino acid long peptide, containing a 10 histidine stretch, was 
fused to the mature erythropoietin sequence. Factor Xa cleavage allowed the 

5 recovery of the mature Epo with only two extra residues at its amino terminus. 

Figure 15 Panel B. IPTG induction of the fusion protein. 
Transformed E. coli BL21 (DE3) cultures (ODeoo=0.6) were grown In 
presence of 1 mM IPTG. Aliquots were collected at 0 (lane 2), 1 (lane 3), 2 
(lane 4) and 3 hours (lane 5) and analyzed on a 15% SDS-polyacrylamide gel, 

10 stained with Coomassie Brilliant Blue. High level production of the fusion 
protein was rapidly obtained. Lane 6 corresponds to an aliquot from 
transformed bacteria grown for 3 hours in a medium without IPTG. Lane 1 
is a low molecular weight standard. 

Figure 15 Panel C. Purification of the fusion protein. After 3 

15 hours of IPTG induction, the produced (His),o-Epo was solubilized in 6 M 
guanidine-HCl and purified on a nickel affinity resin by increasing imidazole 
concentrations following the pET-His system protocol (Novagen). Samples of 
the column eluants were analyzed by SDS-PAGE and stained with Coomassie 
Brilliant Blue. Lane 1-elution by 20 mM imidazole; lane 2-elution by 100 

20 mM imidazole, releasing the fusion protein; lane 3-chelation of the nickel by 
a 100 mM EDTA wash; lane 4-molecular weight standard. 

Rgure 15 Panel D. Detection of the £• coli recombinant Epo on a 
Western blot. Solubilized proteins were separated on ai 15% SDS- 
polyacrylamide gel, transferred to a nitrocellulose membrane and probed with 

25 a 1/2000 dilution of our native wild type polyclonal antibody, as described 
under Materials and Methods. Lane 1-analysis after oxidative reduction; 
lane 2-after dialysis against the factor Xa buffer; and lane 3-after factor Xa 
cleavage. 

Figure 16. Relationship Between Production of Mutems and 
30 Proposed Secondary Structure. This bar graph shows the amount of 
secreted proteins in the supematants of transiently expressed Epo mutants, as 
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detected by radioimmunoassay. The muteins were aligned over a schematic 
representation of the native Epo molecule. Each deletion is shown as a 
stippled bar. the width of which is proportional to the number of residues 
deleted. The four a helices are represented by the black rectangles. The two 
5 disulfide bridges are indicated. These mutagenesis results are in good 
agreement with our prqwsed four a-helical model of Epo. 

Figure 17. Biological Activity of Helfac A Muteins. The top panel 
shows a helical wheel projection of Epo helix A with arrows indicating the 
posiUons where modifications were made to create muteins. The lower panel 
10 is a dose response curve representing the biological activity of wild type Epo 
(B) and helix A muteins E13A(*). R14A (D). E18A (0), K20A (*). and 
E21 A (□) over a range of dosages (X-axis; mU =milli-units of Epo. a relative 
mass measurement) as determined by tiie relative incorporation of pH]- 
tiiymidine measured in counts per minute (cpm) (Y-axis) into Epo dependem 

15 HCD57 cells. 

Figure 18. Bblogkal AcHvUy of Helix D Muteins. The top panel 

shows a helical wheel projection of Epo helix D with arrows indicating the 

positions where modifications were made to create muteins. The lower panel 

is a dose response curve representing die biological activity over a range of 

20 dosages (X-axis) of wild type Epo (□) and helix D muteins D136A (♦), 

R139A (B). K140A (O) and 143A (x) as determined by the relative 

incorporation of pH]-thymidine (Y-axis) into Epo dependem HCD57 cells. 

Hgure 19. Biological Activity of Muteins Adjacent to HcUx D. 

The top panel shows the region adjacent to the C-terminal boundary of Epo 

25 helix D witii arrows indicating the positions where modifications were made 
to create muteins. The lower panel is a dose response curve representing the 
biological activity over a range of dosages (X-axis) of wild type Epo (Q) and 
muteins adjacem to helix D K152A (♦). K154A (B). and Y156A (0) as 
determined by the relative incorporation of pH]-thymidine (Y-axis) into Epo 

30 dependent HCD57 cells. 
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Figure 20. Comparison of Muteins With Increased Biological 
Activity. A dose response curve representing the biological activity of wild 
type Epo (-3e), and muteins R143A (□) and K154A (♦) with increased 
biological activity relative to wild type Epo is shown over a range of dosages 

5 (X-axis) as determined by the relative incoiporation of pH]-thymidine (Y-axis) 
into Epo dependent HCD57 cells, n. 

Figure 21. Comparison of Muteins With Increased Biological 
Activity in Non-malignant Cells. A dose response curve representing the 
biological activity of wild type Epo (-ae), and muteins R143A (□) and K154A 

10 ( ♦ ) with increased biological activity relative to wild type Epo, is shown over 
a range of dosages (X-axis) as determined by the relative incorporation of 
[^H]-thymidine (Y-axis) into murine spleen cells. 

Figure 22. Comparison of Muteins With Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 

15 biological activity of wild type Epo (x), and muteins R143A (□), and K154A 
(♦) with increased biological activity (relative to wild type Epo) is shown 
over a range of dosages (X-axis) as determined by the relative incorporation 
of pH].thymidine (Y-axis) into human Epo-dependent UT-7/Epo cell line 
. (Komatsu. N. et aL, Cancer Res. 51: 341-348 (1991). 

20 Figure 23. Comparison of Muteins with Increased Biological 

Activity in a Human Cell line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein N147A ( ♦ ) with increased 
biological activity (relative to wild-type Epo) is shown over a range of dosages 
(mU) (X-axis) as determined by die relative incorporation of pH]-tiiymidine 

25 (Y-axis) into human Epo-dependent UT-7/^ cell line. 

Figure 24. Comparison of Muteins with Increased Biological 
Activity in a Human Cell line. A dose response curve representing tiie 
biological activity of wild-type Epo (□) and mutein N147A ( ♦ ) witii increased 
biological activity (relative to wild-type Epo) is shown over a range of dosages 
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(mU) (X-axis) as determined by the relative incorporation of pH]-thyniidine 
(Y-axis) into human Epo^Iependent UT-7/^ cell line. 

Figure 25. Comparison of Mutelns with Increased Biological 
Activity in a Human Cell lane. A dose response curve representing the 
5 biological activity of vnld-type Epo (Q) and mutein N147A ( ♦ .B) widi 
increased biological activity (relative to wild-type B^) is shown over a range 
of dosages (mU) (X-axis) as determined by the relative incorporation of ['H]- 
tiiymidine (Y-axis) into human Epo^lcpendcnt UT-7/Epo cell line. 
Biological Activities of Helix A Muteins. 
10 Figure 26. Comparison of Mutdns with Increased Biological 

Activity in a Human Cell line. A dose response curve representing die 
biological activity of wild-type Epo (a) and mutein K20A witii increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by tiie relative incorporation of pH]-tiiymidine 
15 (Y-axis) into UT-7 Epo cells. 

Figure 27. Comparison of Muteins with Increased Biological 
Activity In a Human Cell line. A dose response curve representing die 
biological activity of wild-type Epo (Q) and mutein Y49S witii increased 
biological activity relative to tiie wild type Epo, is shown over a range of 
20 dosages (X-axis) as determined by the relative incorporation of pH]-tiiymidine 
(Y-axis) into UT-7 Epo cells. 
Biological Activities of HeUx B Muteins. 

Figure 28. Comparison of Muteins with Increased Biological 
Activity In a Human Cell line. A dose response curve rqjresenting the 
25 biological activity of wild-type Epo (0) and mutein A73G witii increased 
biological activity relative to die wild type Epo, is shown over a range of 
dosages (X-axis) as determined by die relative incorporation of pH]-tiiymidine 
(Y-axis) into UT-7 Epo cells. 
Biological Activities of Helbc D Muteins. 
30 Figure 29. Comparison of Muteins with Increased Biological 

Activity in a Human Cell line. A dose response curve rq)resenting die 
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biological activity of wild-type Epo (t3) and mutein K140A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into UT-7 Epo cells. 
5 Figure 30a. Comparison of Muteins with Increased Biological 

Activity in a Human Cell line. A dose response curve representing the 
biological activity of wild-type Epo (£3) and mutein R143A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
10 (Y-axis) into UT-7 Epo cells. 

Figure 30b. Comparison of Muteins with Increased Biological 
Activity in a Mouse Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (Q) and mutein R143A with increased 
biological activity relative to the wild type Epo, is shown over a range of 
15 dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into HCD 57 cells. 

Figure 3L Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein S146A with increased 
20 biological activity relative to die wild type l^o, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pHJ-tiiymidine 
(Y-axis) into UT-7 Epo cells. 

Figure 32. Comparison of Muteins with Increased Biological 
Activity in a Human Cell Line. A dose response curve representing the 
25 biological activity of wild-type Epo (Q) and mutein N147A with increased 
biological activity relative to tiie wild type Epo. is shown over a range of 
dosages (X-axis) as determined by tiie relative incorporation of pH]-thymidine 
(Y-axis) into UT-7 Bpo cells. 

Figure 33a. Comparison of Muteins with Increased Biological 
30 Activity in a Human Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (13) and mutein K154A with increased 
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biologlcal activity relative to the wild type Epo. is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 

(Y-axis) into UT-7 Epo cells. 

Figure 33b. Comparison of Muteins with Increased Biological 
5 Activity in a Mouse Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutein K154A with increased 
biological activity relative to the wild type Epo. is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into HCD 57 cells. 
10 Figure 33c. Comparison of Muteins with Increased Biological 

Activity in Non-Malignant Mouse Cells. A dose response curve representing 
die biological activity of wild-type Epo (0) and mutein K154A widi increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
15 (Y-axis) into primary murine spleen cells. 

Figure 34. Bioactivity of Epo Replacement Muteins. The letters in 
die column on the left designate predicted helices (A. B. C, D) or interhelical 
loops (A-B, C-D). The 3'D designation (3' of D) in the table represents 
amino acids 153-166 which arc at tiie carboxy-cnd of die D-helix. The amino 
20 acid of wild type human Epo, its position from die N-terminus, and its 
leplacemem according to die single amino acid letter code, arc listed under die 
mutein column. c.g. S9A, represents a mutation from die wild-type human 
Epo at the amino acid serine of position number 9 to alanine. The diree 
bioassay columns (human UT7 cell line, murine HCD57 cell line, and primary 
25 mouse spleen crytiiroid cells) show specific bioactivity of each mutein, 
expressed as a percentage of wrild type human Epo bioactivity, widi die 
background COS supernatant alone subtracted from die value. The mean and 
standard deviation of die bioassay values are listed for n^3 determinations. 
The numbers widiin die brackets indicate die number of separate COS cell 
30 transfections over (/) die number of separate bioassays diat were performed. 
NS: not secreted from COS 7 cells. 
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Definitions 

In order to provide a clearer and more consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. 

5 Administration. The term "administration" is meant to include 

introduction of the ^o muteins of the invention into an animal or human by 
any 8q)propriate means known to the medical or veterinary art, including, but 
not limited to, injection, oral, enteral, and parenteral (e.g., intravenous) 
administration. 

10 Amino Acid Codes. The most common amino acids and their codes are 

described in the following table: 
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Table 1 
Amino add names and codes 



Amino acid 


oingie icucr 
code 


Three letter 
code 


/\laIlHlC 


A 


Ala 


ArglUluC 


R 


Arg 


/\SpaniU awlU 


D 


Asp 


ASpdraginc 


N 


Asn 


uysteme 


c 


Cys 


UiUCHmiC aClu 


E 


Glu 


uiUvaminc 


0 


Gin 


uiycme 


G 


Gly 


tiisuainc 


H 


His 


isoicucine 


1 


lie 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophane 


W 


Tip 


Tyrosine 


Y 


Tyr 


Valine 


V 


Val 



Animal The term "animar is meant to include all animals whose 
25 production of red blood cells (erythrocytes) is dependent upon, or stimulated 
by, erythropoietin (^o). Foremost among such animals are humans; 
however, the invention is not intended to be so limiting, it being within the 
contemplation of the present invention to treat any and all animals which may 
experience the beneficial effect of the Epo muteins of the invention. 
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Efficacbus Amount. An "efficacious amount*' of an Epo muiein of the 
invention is an amount of such Epo mutein that is sufficient to bring about a 
desired result, particularly the stimulation of red blood cell production, 
especially upon administration to an animal or human. 
5 Erythrocyte. The term "erythrocyte" is intended to refer to a red blood 

cell. 

Erythroid Precursor Cells, By "erythroid precursor cells" is intended 
cells with a capability to proliferate and differentiate into erythrocytes that is 
dependent upon exposure or contact with Epo. 

10 Erythropoietin (Epo). The term ''erythropoietin", or "Epo" is 

primarily intended to refer to the mature form of this hormone. The terms 
"wild type Epo" and "native Epo" are used interchangeably to refer to Epo in 
its naturally occurring, unmodified form. Unless otherwise indicated, amino 
acid positions on the Epo protein referred to herein are made with reference 

15 to the mature. 166 amino acid human Epo sequence shown in Figure 6 and 
corresponding sequences from other species. 

Host Cell, By "host" or "host cell" is intended the cell in which a gene 
encoding an Epo mutein of the invention is incorporated and expressed. An 
Epo mutein gene of the invention may be introduced into a host cell as part 

20 of a vector by transformation. 

. Mutein. By "mutein" is meant a mutant or modified protein with one 
or more modifications to its native amino acid sequence in the form of amino 
acid additions, deletions, or substitutions. An erythropoietin mutein refers to 
a modified erythropoietin protein. 

25 Pharmaceutically Acceptable Vehicle. The term "pharmaceutically 

acceptable vehicle" is intended to include solvents, carriers, diluents, and tiie 
like, which are utilized as additives to preparations of the Epo muteins of the 
invention so as to provide a carrier or adjuvant for the administration of such 
Epo muteins. 

30 Transformation. By "transformation" is intended Uie act of causing a 

host cell to contain a desired nucleic acid molecule, including either a native 
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Epo gene or a gene encoding an Epo mutein of the invention, not originally 
part of that cell using methods known in the art. 

TransfecUon. By "transfection" is intended the introduction of a DNA 
or RNA vector carrying a desired nucleic acid molecule, including either a 
5 native Epo gene or a gene encoding an Epo mutein of the invention, into a 
host cell. 

Vector. By "vector" is intended a DNA element used as a vehicle for • 
cloning or expressing a desired gene, such as an Epo mutein gene of the 
invention, in a host. 

10 Detailed Description of the Preferred Embodiments 

This invention is drawn to modifications of Epo which enhance its 
biological activity. Due to their enhanced activity, the Epo muteins of the 
invention provide a preferable alternative to native Epo where it is used to 
stimulate red blood cell production (e.g. treatment of blood disorders, cell 

15 culture, etc.). 

The present invention teaches a structural model of the Epo protein 
useful for identifying functional regions which may be modified to enhance 
biological activity. According to the invention, Epo is predicted to have a 
four anti-parallel amphipathic a-helical bundle structure. Based upon this 

20 predicted structure, functionally important regions predicted to reside on the 
external surfaces of the helices (designated A-D, see Figures 9A-9B) are 
identified. In particular, the external surfaces of helices A and D, predicted 
to be involved in Epo receptor binding, are identified. According to the 
invention, these functionally important regions serve as targets for 

25 modifications which may enhance biological activity. Increased activity 
relative to the wild-type Epo can be found with amino acid substitutions in the 
A,B,C or D helix. 

According to one aspect of tiie invention, Epo muteins are provided in 
which one or more of the amino acids witfiin the predicted external surfaces 
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of helices A, B. C. D and 3' of D, or within the region immediately adjacent 
to the C-terminal end of helix D, have been replaced. In particular, amino 
acid substitutions at positions 20, 49, 73, 140, 143, 146, 147 and 154 of wild 
type Epo have been replaced. The inventors have discovered that substitution 
5 of the amino acids normally occurring at these positions results in Epo muteins 
with significantly higher biological activity than wild type ^o proteins. 

According to the invention, the amino acids at position 20 (typically • 
alanine), 49 (typically serine), 73 (typically glycine), 140 (typically lysine), 
position 143 (typically arginine), position 146 (typically serine), position 147 
10 (typically asparagine), and 154 (typically either lysine or threonine) of wild 
type Epo are preferably replaced with an alanine residue. In addition to 
alanine other amino acids with a chemical structure and properties similar to 
alanine, such as serine and threonine, are contemplated by the invention as 
suitable substitutes for achieving Epo muteins of the invention with enhanced 
15 biological activity. 

In another aspect of the invention, a portion of the predicted interloop 
region between helices A and B extending from amino acid 48-52 is identified 
as important to Epo function. According to the invention, this functionally 
important region serves as another target for modifications which may enhance 
20 biological activity. The inventors have discovered that substitution of die 
amino acid at position 49 results in Epo muteins witii significantly higher 
biological activity than wild type Epo proteins. 

According to die invention, tiie amino acid at position 49 (typically 
tyrosine) of wild type Epo is preferably replaced widi a serine residue. Otiier 
25 amino acids with a chemical structure and properties similar to serine, such 
as alanine and tiireonine, are contemplated as suitable substitutes for achieving 
Epo muteins of the invention witii enhanced biological activity. 

The modifications taught by tiie present invention occur in regions that 
are highly conserved among Epo proteins from evolutionarily divergent species 
30 (see Example 1 and Figure 6 in particular). These modifications arc thus 
expected to be broadly applicable to Epo proteins which arc substantially 
similar to die human Epo protein in die regions where die modifications occur, 
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including, but not limited to, Epo proteins from monkey, mouse, rat, sheep, 
pig, cat, and dog. 

Biological activity of Epo muieins can be determined by assaying the 
ability of these proteins to stimulate the proliferation of Epo responsive cells 

5 such as murine spleen cells (Krystal. G., Exp. HematoL 7i:649-60 (1983); 
Goldberg. M.A., etaL, Proc. Natl. Acad. ScL USA «:7972.7976 (1987)), 
murine Epo responsive murine erythroleukemia cells (Hankins, W.D., et al, • 
Blood jamz (1987)), and human Epo-dependent UT-7/Epo cells (Komatsu, 
N., et ai. Cancer Res. 57:341-348 (1991)). 

10 When assaying for biological activity, it is important to recognize that, 

as taught by the present invention, high levels of biologically active Epo can 
cause terminal maturation and cessation of division of Epo responsive cells. 
Because of this possibly confounding phenomenon, it is important to determine 
biological activity based on a full dose response curve over a broad range of 

15 dosages which includes non-saturating levels of the particular Epo mutein 
being assayed. 

Although the mechanism by which the modifications taught by the 
invention increase biological activity is not completely understood, one 
possibility is tiiat these modifications enhance receptor binding affinity. This 
20 possibility is tfiought to apply particulariy to tiie modifications at positions 20, 
49, 73, 140, 143, 146, 147, and 154. which lie within a region predicted to 
be involved in Epo receptor binding according to die teachings of tiie present 
invention. 

Genes Encoding Epo muteins (Epo mutein genes) 

25 In addition to die Epo muteins themselves, genes encoding tiiese 

muteins are also contemplated by tiie present invention. Genes encoding tiie 
Epo muteins of die invention may be made by modification of die native Epo 
gene using standard mediods well known to diose of skill in die art. 

The native Epo gene may be obtained using a variety of standard 

30 techniques. For example, a DNA molecule corresponding to die known 
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sequence of the Epo gene from a number of species may be artificially 
synthesized (See Example I and Figure 4). The Epo gene may also be isolated 
from a cDNA or genomic library using nucleic acid probes based on available 
Epo DNA sequence information and standard hybridization techniques as 
5 taught in U.S. Pat. No. 4,703,008. issued Oct. 27. 1987, which is herein 
incorporated by reference (See also Jacobs et al.. Nature 313: 806-810 
(1985)). Alternatively, amplification of Epo gene sequences by polymerase • 
chain reaction (PGR) may be accomplished using primers based on available 
sequence data as taught in Example I. 
10 Once obtained, the wild type Epo coding sequence can be altered to 

encode an Epo mutein of the invention using standard oligonucleotide-directed 
in vitro mutagenesis techniques (See ZoUer, M. and Smith, M., Methods in 
Enzymology 100: 468-500 (1983); Dente, L. etai, NucL Adds Res. 11: 1645 
(1983); Kunkel. T. A„ et al, Meth. EnzymoL 75*367-382 (1987); Vandeyar. 
15 M. et al. Gene 65: 129 (1988)). These techniques generally involve 
hybridization of an oligonucleotide carrying the desired alteration to a single 
stranded target DNA. This is followed by complementary strand synthesis of 
the target DNA which incorporates the oligonucleotide carrying the desired 
. alteration. The complementary strand containing the desired alteration is then 
20 propagated in a bacterial host cell. Specific 2q)plication of this type of 
technique to modify the Epo gene is described in Examples II and III and in 
Australian Patent Publication No. AU-A-59145/90 by Fibi et al. and in U.S. 
Pat. No. 4,835,260 by Shoemaker, botii of which are herein incorporated by 
reference. 

25 Expression of Epo mutein Genes 

Genes encoding Epo muteins of the invention can be introduced and 
expressed in a selected host cell system using conventional materials and 
techniques. DNA elements such as promoters, enhancers, polyadenylation 
sites, transcription termination signals, and the like should be associated with 
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the Epo mutein coding sequence so as to promote and control expression of 
the Epo mutein. The specific regulatory elements used will depend upon the 
host cell system selected for expression, whether secretion of the protein is 
desired, and other considerations readily apparent to one of skill in die art. 
5 Various vectors may be employed as vehicles for tiie introduction and 

expression of Epo mutein genes in a host cell. Such vectors useful in die 
different host cell types are well known and include, for example, the 
mammalian expression vectors pSG5 (Stratagene). p-RKl (Genetics Institute). 
p.SVK3 (Pharmacia). p-EUK-Cl (Clontech). pCDM (Invitrogen), pc DNAI 
10 (Invitrogen). and die bacterial expression vectors pFLAG-1 OBI), all pET 
system plasmids (Novagen). pTrcHis (Invitrogen). die pGEX series 
(Pharmacia), and pKK 233-2 (Clontech). These vectors may be maintained 
as episomes in the host cell or tiiey may facilitate integration of the Epo 
mutein gene into die host cell genome, or bodi. Vectors may also include 
15 otiier useful features, such as genes which allow for die selection or detection 
of cells in which tiiey have been successfully introduced. 

Host cells suitable for expression of die Epo muteins of die invention 
include, but are not necessarily limited to. E. coli, yeast, insect cells, plant 
cells, and a variety of mammalian cell types, including in particular Chinese 
20 Hamster Ovary (CHO) cells. Cos? cells. Cosl cells, baby hamster kidney 
cell?, and CVl cells. Bpo mutein genes can be introduced into a host cell 
using standard transformation or transfection techniques. 

Host cells which provide for glycosylation are preferred for production 
of Epo muteins to be used /« vivo since die carbohydrate structure of Epo is 
25 important for optimal biological activity in mvo (See Dordal. M.S. et al., 
Endocrinology 116(6): 2293-2299 (1985)). 

The Epo muteins of die invention may be recovered and purified from 
host cells in which diey are expressed using known metiiods such as 
immunoaffinity chromatography widi antibodies to human Epo. 
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Therapeutic Use ofEpo muteins 
of the Invention 

The Epo muteins of the invention are suitable for therapeutic use in 
animals, particularly humans, and may be used in the same manner as wild 

5 type Epo, except that lower dosages will be required to achieve the same level 
of biological activity. 

Administration of the Epo muteins of the invention may be 
accomplished by any of the methods known to the skilled artisan, provided 
that the method effectively places the Epo muiein in the appropriate 

10 environment to exhibit its activity (i.e. in contact with erythroid precursor 
cells). A preferred method is by parenteral routes, including intravenous and 
subcutaneous administration. 

Administration will ordinarily include an efficacious amount of the Epo 
mutein supplied in a pharmaceutically acceptable vehicle. The amount of Epo 

15 mutein which is efficacious will be determined by a trained professional 
depending upon a number of considerations, including the condition being 
treated and its severity, the sex and body weight of the subject being treated, 
the method of administration, and the presence of compositions in the 
pharmaceutically acceptable vehicle which affea Epo activity. 

20 . In any event, the efficacious amount of a ^ mutein as taught by the 
invention will be significantly less than the corresponding amount of 
unmodified Epo needed to achieve die same degree of efficacy. The 
difference bet>yeen the efficacious amount of the Epo mutein and the 
analogous amount of unmodified Epo will correspond with the increased 

25 biological activity exhibited by die Epo mutein. It is contemplated that an 
efficacious amount of an Epo mutein of the invention will be as low as 10-fold 
less tiian native Epo, or as low as about 5-10 units/kg body weight for treating 
chronic renal failure patients or about 10-30 units/kg body weight for treating 
HIV-infected patients witii serum Epo levels less than or equal to 500 mil/ml 
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(milliunits/milliliter) who are receiving a dose of AZT equal to, or less than. 
4200 mg/week. 

Since the Epo muteins of the invention can be used effectively at lower 
dosages relative to wild type Epo, their use may reduce potential adverse 

5 effects associated with administration of Epo such as exacerbation of 
hypertension, seizures, headaches, tachycardia, nausea, clotted vascular 
access, shortness of breath, hyperkalemia, and dianhea (see PDR, 1993 
edition, at pp. 603-604). 

Further description of methods and materials useful in the recombinant 

10 production of the Epo muteins of the invention and their use is provided in the 
teachings of U.S. Patent No. 4,835.260 by Shoemaker and U.S. Patem 
No. 4,703,008 by Lin, both of which are herein incorporated by reference in 
their entirety. 

The invention may be better understood and appreciated by reference 
15 to the following examples. These examples are provided merely for 
illustrative purposes and should not be considered in any vray as limiting the 
scope of tije claimed invention. 

EXAMPLE I 

Erythnpoietui Structure-Funedon Relationships: 
20 High Degree of Sequence Homoiogy Among Mammals 

Summary 

In order to investigate structure-fiinction relationships of erythropoietin 
(Epo), we have obtained cDNA sequences tiiat encode tiie mature Epo protein 
of a variety of mammals. A first set of primers, corresponding to conserved 
25 nucleotide sequences between mouse and human DNAs, allowed us to amplify 
by PCR intron 1-exon 2 fragments from genomic DNA of hamster, cat, lion, 
dog, horse, sheep, dolphin and pig. Sequencing of tiiese fragments permitted 
tiie design of a second generation of ^ies-specific primers. RNA was 
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prepared from anemic kidneys and reverse-transcribed. Using our battery of 
species-specific 5' primers, we were able to successfully PGR amplify ^o 
cDNA from Rhesus monkey, rat. sheep, cat and pig. Deduced amino acid 
sequences of mature Epo proteins from these animals, in combination with 

5 known sequences for human. Cynomolgus monkey and mouse, showed a high 
degree of homology, which explains the biological and immunological cross 
reactivity that has been observed in a number- of q)ecies. Human Epo is 91 % 
identical to monkey Epo, 85% to cat Epo and 80-82% to pig, sheep, mouse 
and rat Epos. There was full conservation of a) the disulfide bridge linking 

10 the NHj and COOH termini; b) N-glycosylation sites; and c) predicted 
amphipathic a-helices. In conurast, the short disulfide bridge (C29/C33 in 
human) is not invariant. Cys33 was replaced by a Pro in rodents. Most of 
the amino acid replacements were conservative. The C-terminal part of the 
loop between the C and D helices showed the most variation, with several 

15 amino acid substitutions, deletions and/or insertions. Calculations of 
maximum parsimony for intron 1/exon 2 sequences as well as coding 
sequences enabled the construction of cladograms that are in good agreement 
with known phylogenetic relationships. 

INTRODUCTION ^ 

20 A large number of eariy physiologic studies have established extensive, 

though not complete, biological cross-reactivity between Epos of man and a 
number of other mammals including mouse, rat, sheq) and rabbit (Jelkmann, 
W., Physiol Rev. 72:449 (1992); Garcia, J.F. et al, Proc. Soc. Exp. Biol 
Med, man-m (1962); Zanjani, E.D, etal. Proc. Soc. Exp, Biol 

25 Med. i3J:1095-1098 (1969); Biljanovic-Paunovic, L. etal. Period. Biol 
90:421-427 (1988)). In contrast, non-mammalian vertebrates (amphibians 
(Rosse. W.F.. et al, Bbod 22:66-72 (1963); Pinder, A. et al, J. Exp. Biol 
i(?J:205-213 (1983)), birds (Rosse, W. etal. Blood 27:654-661 (1966); 
Black, CP. etal, Respirat. Physiology 59:217-239 (1980)), fish (Zanjani, 
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E.D., et al. Science 35:573 (1969)) have erythropoietic hormones that fail to 
cross-react with mammalian erythroid cells, and vice-versa (Rosse, W.F., 
et al.. Blood 22:66-12 (1963); Pinder. A. et al., J. Exp. Biol. 705:205-213 
(1983); Rosse. W. et al. Blood 27.65A-66\ (1966)). 
5 Thus far, the Epo genes of man (Jacobs. K. et at.. Nature 373:806-10 

(1985) ; Lin. F.-K. et al., Proc. Natl. Acad. Sd. USA 82:7580-7584 (1985)), 
a monkey {Macacafascicularis) (Lin, F.-K. et al.. Gene ^:201-209 (1986)) 
and a rodent, the mouse (McDonald, J.D. et al., Mol. & Cell Biol. 6:842-848 

(1986) : Shoemaker, C.B. et al., Mol. & Cell Biol. 6:849-858 (1986)) have 
10 been cloned, sequenced, and expressed. In view of the marked cross- 
reactivity betvreen mammalian Epos, it is not surprising that there is a high 
degree of sequence homology in the coding region of the nmture secreted 
proteins. In keeping with their dose phylogcnetic relationship, human and 
monkey Epos are 94% and 91% identical in nucleotide and amino acid 

15 sequence respectively. In contrast, human and mouse Epos are 76% identical 
in nucleotide sequence and 80% identical in amino acid sequence. 

Direct information on the three-dimensional structure of Epo is not yet 
available. Insights into structure-function relationships of Epo can be gained 
from the analyses of a more complete set of animal sequences. Such 

20 information could be useful for sequence-based computer modeling of three- 
dimensional structure. Moreover, a larger data base would permit the 
identification of highly conserved domains that are likely to be crucial to 
folding and/or biological function. Finally, comparative amino acid and 
nucleotide sequence information on Epo provides additional data for 

25 investigating phylogeneuc relationships. 

In tiiis Example, die use of PCR based techniques for obtaining tiie full 
coding sequences of Epos of Rhesus monkey, rat, sheep, pig and cat, as well 
as partial sequences from four other species is demonstrated. 
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Materials and Methods 

Animal Samples. Human hepatoma Hep3B cell line and multiple 
kidney-derived cell lines from hamster (BHK), sheep (MDOK), pig (LLC- 
PKl), dog (MDCK), cat (CRF-K), lion (PALl-K) and spotted dolphin (SPl-K) 

5 were obtained through the American Type Culture Collection (American Type 
Culture Collection, 12301 Pariclawn Drive, Rockville, MD 20852). 
Monolayer cells were grown in 100 x 20 mm tissue culture dishes using 
recommended media and maintained in a humidified 5% C02/95% air 
incubator at 37°C. In some experiments, cells were made hypoxic by 

10 overnight incubation in 1% O., 5% CO^, 94% Nj at 37°C. 

Poly A+ RNA prepared from Rhesus monkey kidney was purchased 
from Clontech (Palo Alto, CA). Kidneys from anemic cat and horse were 
obtained from veterinarians at Tufts School of Veterinary Medicine and the 
University of Nebraska, respectively. Anemia was induced by three 

15 consecutive daily intraperitoneal injections of phenylhydrazine (60 mg/kg body 
weight) into male Sprague-Dawley rats (SD-strain) and by repeated bleeding 
of the sheep and the pig. Kidneys were aseptically removed after induction 
and stored immediately in liquid nitrogen. 

A 4 kb genomic human Epo clone (g Epo4) was provided by Genetics 

20 Institute (Boston, MA). 

DNA and RNA Preparations. DN As from cell lines or homogenized 
kidneys were extracted using pancreatic RNAase/SDS/proteinase K following 
a procedure modified from Blin and Stafford (Blin. N. et aL, Nucleic Acids 
Res. 5:2303-2308 (1976)). 

25 Kidneys were homogenized in 4 M guanidine isothiocyanate (10 ml/g 

of frozen organ), containing 0.1 M beta-mercaptoethanol and 10% N-lauryl- 
sarcosine. Total RNAs were isolated by centrifugation over 5.7 M CsCl 
(Chirgwin, J.M. etaL, Biochemistry 18:5294 (1979)). After ethanol 
precipitation, the samples were resuspended in diethyl pyrocarbonate-treated 

30 water and stored at -70**C. Confluent monolayer cells were washed twice with 
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sterile phosphate buffered saline and directly lysed in the 4 M guanidine 
isothiocyanate solution. Total RNAs were isolated as described above for the 
kidneys. 

RNA samples were first converted into single strand cDNA. 2 to 4 /tg 
5 of total RNA fiom kidney or 500 ng to 1 of total RNA from cultured cells 
were denatured at 68»C in the presence of 2 fig of oligo dT,,,,. Reverse 
transcription was carried out in a 20 ^1 final volume, containing 50 mM Tris- 
HCl pH 8.3. 6 mM MgCU. 40 mM KCl. 10 mM DTT. 500 f»M dNTPs. 20 
u RNAsin (Promega. Madison. Wl) and 20 u of avian myeloblastosis virus 
10 reverse transcriptase (AMV-SuperRT. Molecular Genetic Resources. Tampa. 
FL). The reaction vras allowed to proceed at 37»C for 20 minutes, then 
at 42»C for one hour. Inactivation of the enzyme was performed at lOO^C 
for 10 minutes. cDNA samples were stored at -TO'C. 

Polymerase Chain Reaction. 20 to 100 ng of genomic-DNA or 0.5 
15 to 1 /tl of the RT-reaction were used as templates for amplification. 

Standard PCR reactions were performed in a 100 n\ volume 
containing 50 mM Tris-HCl pH 8.3. 50 mM KCl. 1.5 mM MgCl,. 0.01 % w/v 
gelatin. 200 iM dNTPs. 30 pM of each sense (5') and antisense (3') primers 
and 0.5 units of Taq DNA polymerase (Perkin Elmer Cetus. Emeryville. CA). 
20 After an initial denaturation step for 5 minutes at routinely 35 cycles 
of PCR were performed on the DNA Thermal Cycler (Perkin Elmer Cetus). 
The denaturation step of each cycle was carried out at 94»C for 1 min. For 
each sample and/or the primer pair used, annealing temperatures were 
optimized (from 45''C to eO^C) for a 2.minute reaction. A 3-minute 
25 elongation step at 72*C ended each cycle. 

Amplification products were analyzed on 1 to 3% agarose gel-TAE. 
Nu Sieve or Seaplaque agaroses (FMC, Rockland. ME) were used for 
preparative purposes. Specific PCR products were recovered from the gel. by 
the use of Uie Gene Clean II kit (Bio 101, U Jolla. CA). by Spin X 
30 centrifugation (Costar, Cambridge. MA) or by standard phenol/chloroform 
extractions of melted gel slices. 
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Subcloning and Sequencing. Small (250-330 bp) amplified genomic 
fragments were directly subjected to automated sequencing, using the PCR 
primers as DNA sequencing primers. 

PCR-genetated cDNA products were blunt-ended using Klenow 
5 fragment and 5' phosphorylated by T4 DNA kinase (Boehringer Mannheim. 
Indianapolis. IN). They were then cloned into the Smal site of the phagemid 
pBluescript II SK (Stratagene. La JoUa, CA). The double-stranded plasmid 
was sequenced using the SK and KS sequencing primers. 

In all cases. Sequencing reactions were carried out on an Applied 
10 Biosystems 373A Automated DNA Sequencer utilizing ABI's DyeDeoxy 
Terminator Sequencer kit (Foster City, CA) and thermal cycling with Taq 
DNA polymerase (Promega. Madison. WI) as previously described (Tracy, 
T.E. etal., Biotechniques 11:6&-14 (1991)). To avoid errors, for each 
species, samples derived from different amplification/cloning experiments were 
15 prepared and both strands were sequenced. 

Mammalian Expressbn; Bioactivity. Vectors containing full-lengtfi 
mammalian Epo coding sequences were subcloned into pSG5 plasmid 
(Pharmacia, Piscataway, NJ) and were transiently expressed in Cos7 cells. 
72-hour supematants were tested for their ability to sustain cellular 
20 proliferation of the Epo-depcndent HCD57 cell line (Sawyer, S.T. et al.. 
Blobd 72 (suppl. 1): 132a (1988)). 

Results and Discussion 

Because of the known biological cross-reactivity between various 
mammalian Epos and the presumption of strong homology in the coding 
25 sequences, a logical cloning strategy would be to screen cDNA libraries with 
moderate to low stringency. However. Epo mRNA is expressed at barely 
detectable levels in all tissues except in hypoxic kidney and liver (Goldberg. 
M.A. et al., Proc. Natl. Acad. Sci. USA 84: 7972-7976 (1987); Fandrey, J. 
etal.. Blood 81: 617-623 (1993)). Therefore it would be necessary to 
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generate libraries for each of the species of interest. Our primary research 
interest is in structure-function relationships of Epo and therefore we require 
full coding sequences of the mature Epo protein, rather than full cDNA 
sequences. Accordingly, we elected to employ a PCR cloning strategy 

5 predicated on known strong homologies in Epo cDNA sequences that flank the 
region encoding the mature protein. There was sufficient conservation of 
sequence around the NCOl site in the 3' untranslated region (UTR) that a- 
single primer could be used to amplify full length mature Epo coding sequence 
in lat. sheep, pig and cat as well as dog and rabbit (data not shown). 

10 However, we were unsuccessful in finding a universal 5' primer and therefore 
had to generate species specific sequences from amplifications of genomic 
DNA. This two stage PCR strategy proved to be satisfactory in generating 
highly accurate sequence information witii a minimum expendinire of time and 
resources. In all instances, tiiere was full agreement between sequences of 

15 complementary strands. 

Preparation of Mammalian Epo cDNAs 

Primer Design for PCR AmpWtcaHon of Epo. The sequences of Epo 
genes from tiiree species have been already published: two primates, human 
(Jacobs, K. et ai. Nature 31 3:iQ6-m (1985); Lin, F.-K. et aL, Proc. Nail. 

20 Acad. Sd. USA 52:7580-7584 (1985)) and Cynomolgus monkey (Macaco 
fascicularis) (Lin, F.-K. et ai. Gene 44:201-209 (1986)) and a rodem, tiie 
mouse (Mas musadus) (McDonald, J.D. et ai, Mol. & Cell Biol. 6:842-848 
(1986); Shoemaker, C.B. et al., Mol. A Cell Biol. 6:849-858 (1986)). There 
is substantial homology at tiie nucleotide level not only in die coding 

25 sequences, but also in portions of introns and in 5' and 3' untranslated 
sequences of exons 1 and 5. respectively. A number of primers, 
corre^nding to conserved nucleotide sequences between mouse and-human. 
were synthesized and tested for their ability to produce genomic-PCR 
fragments from a wide variety of mammalian cell lines. A 4 kb human 
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genomic clone and genomic DN A extracted from the human hepatoma cell line 
Hep3B were used as control templates for the PGR amplifications. 

In particular, two pairs of primers, IV1/EX2R and EX5/NC01 
(Figure 1), directed the amplifications of fragments of 330 and 250 bp 
5 (respectively) from genomic DNA of all human and mammalian cell lines that 
we investigated. IV 1 is a 25-mer oligonucleotide localized in intron 1, 
216/217 bases upstream from the start of exon 2. Human and mouse are 
identical except that the human sequence contains an extra G nucleotide at 
position 996. IVl primer corresponding to the mouse sequence was 
10 synthesized. The 23-mer EX2R ends 28 nucleotides upstream from the 3' end 
of exon 2. EX5 is a 22-mer, beginning 34 bases downstream from the 5' end 
of exon 5. NCOl represents a 20 bp, 100% conserved DNA fragment, 
starting 1 12/113 bases downstream from the TGA stop codon and contains the 
unique NCOl site present in the human and mouse gene. 
15 An ATG primer was also synthesized, corresponding to a 20 

oligonucleotide stretch, extending both 5' and 3' from the initiator methionine 
codon. This sequence is also totally conserved between primates and mouse. 
While combinations of exonic primers (ATG1EX2R, EX5/NC01 and 
ATG/NCOl) were able to direct amplification from cDNA prepared from 
20 RNA of Uie human Epo-producing cell line Hep3B, no amplification was 
obt^ned on cDNA prepared from uninduced or hypoxia-induced BHK, 
MDOK, LLC-PKl, MDCK, CRF-K. PALl-K and SPl-K cells (results not 
shown). Therefore tiiese renal cell lines apparently do not express ^ 
mRNA, eitfier constitutively or after hypoxic stress, at a level detectable even 
25 by RT-PCR. Furthermore, no Epo protein was detectable by 
radioimmunoassay in tiie supematants of cells msuntained overnight in \% 0,, 
5% COj, 94% Nj at 37X (Goldberg, M.A. c/ ai, Proc. Natl. Acad. ScL 
USA W:7972-7976 (1987)). 

The amplification of species specific 1V1/EX2R and EX5/NC01 
30 genomic fragments allowed us to design a strategy shown in Figure 2 for 
obtaining cDNA clones containing die complete coding sequence of mature 
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Epo protein from various mammals. The IV1/EX2R primer pair resulted in 
a genomic fragment ( - 330 bp) corresponding to the 3' third of the first intron 
and 80% of exon 2, encoding the COOH-end of the propeptide and the NHj- 
terminal amino acids of the mature protein. EX5/NC01 primers amplified a 

5 segment of exon 5, containing the COOH-end of the Epo protein and a 3' 
untranslated sequence of about 140 bp downstream of the stop codon: DNA 
sequencing of these 5' and 3' fragments permitted, if necessary, the design of • 
a second generation of species-specific primers (SP), localized outside the 
coding pan of the mature protein (i.e., in the sequences encoding the 

10 propeptide and the 3' untranslated region). 

Amplification of Partial Intron If Exon 2 Genomic Clones. We first 
explored the efficacy of the IVl and EX2R primers for amplification of 
genomic fragments from different purified DNA templates. In all the 
reactions, fragments of predicted size were the main (and. when using the 

15 most stringent annealing temperature, often the only) products detected by 
agarose gel electrophoresis. The (direct) sequences of these PCR-generated 
fragments are presented in Figure 3. The nucleotide alignment includes 6 
different orders of mammals: one primate, human (Homo sapiens); two 
artiodactyls. sheep (Ovis aries) and pig (Sins scrofa); oneperissodactyl, horse 

20 {Equus caballus); one cetacean, spotted dolphin (Stenella plagiodon); three 
canjivores, dog (Canis famiUaris), cat (Felis catus) and lion {Panthera leo)\ 
two rodents, mouse {Mus musculus) and hamster (Cricetus cricetus). Cat and 
lion exhibited an almost complete sequence identity, with only one T to G 
nucleotide substitution near the 5' end of the amplified portion of the first 

25 intron. 

The ability of the intronic IVl to anneal to genomic DNA from various 
species and. in combination with EX2R, to amplify PCR fragments of identical 
size, demonstrated a high degree of conservation of sequence and position 
between mammals. This suggests that IVl sequence may be involved to some 
30 extent in the regulation of the Epo gene expression. The comparison of 
IV1/EX2R sequences also revealed two remarkably conserved intronic 



wo 94/24160 



PCT/US94/04361 



-37- 

sequences: AT(T/A)GAATGAA(G/C)GC [SEQ. ID NO: 1] (nucleotides 1046 
to 1058 in the human gene) and A(A/DGGTN(G/C)GGG [SEQ. ID NO: 2] 
(nucleotides 1085 to 1094). 

AmpUficadon of Partial Exon 5 Fragments from Genomic DNAs. 
5 PGR reactions were also performed applying the EX5/NC01 primer pair to 
various purified genomic DNAs. As previously observed for the amplification 
of 1V1/EX2R, in all the tested samples a unique main band of about 250 bp 
(as predicted for human and mouse genes) was detected on analytical agarose 
gel electrophoresis. Direct DNA sequencing of these fragments demonstrated 
10 that the bulk of the purified PGR product corresponded to the expected Epo 
exon 5 sequence. However, for some DNA samples, purified from horse 
kidney and several cell lines (BHK and LLC-PKl in particular), unknown 
sequences were present to various extents. These minor contaminating PGR 
products generated extraneous peaks in the sequence data, resulting in 
15 ambiguities at several positions. Nevertheless, computer-edited analyses of 
generated 3' non-coding sequences were sufficient to design, if necessary, 
specific primers with greater than 95% accuracy (data not shown). 

Cloning of Partial cDNAS Encoding the Complete Mature Epo 
Protein. As we previously mentioned, we were unable to obtain any 
20 amplification with cDNAs prepared from renal-derived mammalian cell lines. 
Therefore, we tested our battery of primers on reverse-transcribed RNAs 
prepared from kidney of several mammals. We have successfully amplified 
Epo cDNA from: one primate. Rhesus monkey (Macaca mullata), one 
carnivore, cat (Felis catus), one rodent, rat (Rattus norvegicus) and two 
25 artiodactyls, pig {Sus scrofa) and sheep (Ovis ones). The aligned nucleotide 
sequences are presented in Figure 3. They have been submitted to GenBank 
and have been assigned accession numbers. 

Rhesus monkey, rat and sheep cDNAs were amplified using the 
ATG/NCOl primer-pair combination. Pig and cat sequences were obtained 
30 by the use of a 5' 24-mer specific primer (SPl-5' 
TGCTTGTGCTATCTTTGCTGaGC 3') [SEQ. ID NO: 3]. IV1/EX2R 
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sequencing demonstrated that this exon 2 sequence was shared by the two 
species. In all PGR amplifications, NCOl was used as the reverse primer. 
Conservation of the NCOl sequence among mammals suggests a potential 
biological role of this nucleotide sequence (perhaps in modulating the stability 

5 of Epo mRNA) (Rondon, I. J. etai, J. Biol. Chm. 266:16594-16598 
(1991)). Considerable homology is also found in the sequenced 3' 
untranslated segment of exon 5 (Figure 4). 

When compared with human Epo. the overall percentages of nucleotide 
identity are, respectively: 93.5 for the Rhesus monkey, 86.7 for the cat, 85.7 

10 for the pig. 83 for the sheep and 75.5 for the rat. Rhesus monkey and 
Cynomolgus monkey (Lin, F.-K. et al. , Gene 44:201-209 (1986)) were 99.6% 
identical. Mouse (McDonald, J.D. etai, Mol. &. Cell Biol. 6:842-848 
(1986); Shoemaker. C.B. et al., Mol. & Cell Biol. 6:849-858 (1986)) and rat 
nucleotide sequences showed greater than 93% identity, the two sequenced 

15 artiodactyls, pig and sheep, showed 88% identity.' 

Comparison o/MammaUan Epo Primary Sequences 

Propeptides. The predicted amino acid prqjeptide sequences of several 
mammals are presented in Figure 5. In the primates, there was strong 
conservation of die sequences of deduced propeptides, with only two amino 

20 acid (aa) substitutions at positions -11 (Leu in humans vs. Val in both 
monkeys) and -2 (Leu in humans vs. Pro in both monkeys). Expression of 
Cynomolgus monkey Epo gene in cultured mammalian cells resulted in the 
production of mature monkey Epo that was elongated at the N-terminus by 
three additional residues: Val-Pro-Gly (Lin, F.-K. et al.. Gene 44:201-209 

25 (1986)). As the Rhesus monkey has die same substitutions, an identical site 



' Several PCR attempts on total RNA purified from the hoi-se's kidney 
were unsuccessful, even when using 5' and 3' ^ific equine primers (SP), 
presumably because analysis on 1% formaldehyde agarose gel demonstrated 
the degradation of die RNA. 
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10 



of cleavage is likely. The Leu to Pro amino acid replacement at -2 is 
probably responsible for the differential activity of the signal peptidase 
observed between man and monkeys. The three rodents, mouse, rat, and 
hamster, were found to have identical propeptide amino acid sequences. 

Mature protein. A comparison of the three previously published and 
our five deduced amino acid sequences of mature Epo proteins is shown in 
Figure 6. Erythropoietin is highly conserved among mammals. The amino 
acid alignment showed that more than 63% of the molecule is composed of 
invariant amino acids (106 residues) (Figure 6A). Most of the observed 
substitutions are conservative, involving residues with similar physical and 
chemical properties. The calculated percentages of sequence identity are 
shown in Table II below. 



15 



20 



Table n 

Degree of conservation among nuunmals 
of the mature Epo proteins 

The percentages of identity among the various sequences were 
determined from the amino acid alignment reported in Figure 6. 





% identity 


Primates 


ManTRhesus Monkey 


90.5 




Rhesus/Cynomolgus 


98.8 


Rodents 


man/mouse 


79.8 




man/rat 


82.1 




mouse/rat 


93.5 


Artiodactyls 


man/sheep 


81.0 




man/pig 


8L6 




sheep/pig 


88.7 


Carnivores 


man/cat 


84.5 



There is 100% conservation of Cys7 at the N-terminal portion of the 
polypeptide and Cysl61 near the C-terminus. This disulfide bridge is essential 



wo 94/24160 



PCTAJS94/04361 



.40- 

to the formation of a stable and functional cytokine (Wang. F.F. et aL, 
Endocrinology 776:2286-2292 (1985)). In contrast, while Cys29 is invariant 
in all the species we examined, Cys33 is present only in primates, sheep, pig 
and cat, but not in the two rodents where there is a proline. The lack of 

5 functional importance of this short disulfide loop is underscored by muteins in 
which tyrosine substitutions at each or both of these sites have no effect on 
Epo's biological activity (See Example II). 

There is also 100% conservation of all three asparagine residues 
responsible for N-linked glycosylation. Even though Epo, deprived of its 

10 carbohydrate either by enzymatic cleavage (Takeuuchi, M. et ai, /• BioL 
Chem, 265:12127-12130 (1990)) or by production in bacteria (Nahri, L.O. 
etaL, J. BioL Chem. 266:23022-23026 (1991)) has full in vitro biological 
activity, survival of the hormone in the circulation depends upon N-linked 
glycosylation (Spivack, J.L. et a/.. Blood 75:90-99 (1986)). In contrast, the 

15 O-linked glycosylation site (Serl26 in human Epo) is not essential for 
biological function since it is missing in mouse and rat Epos. 

As shown in Figure 6, there is very high conservation of sequence in 
regions which, by algorithms based on primary structure, are predicted to be 
alpha helices (See Example II). The few differences in sequence widiin these 

20 helical regions are conservative rq)lacements. Indeed, some sites within these 
predicted alpha helices arc conserved in corresponding helices of other 
cytokines (Manavalan, P. etal, J. Prot. Chem. 77:321-331 (1992)). In 
contrast, regions predicted to be inter-helical loops are less well conserved, 
particularly the 14 residue stretch between residues 116 and 130 of human 

25 Epo. In Example II a mutein with a deletion of residues 111 through 119 is 
shown to have normal stability and biological activity, whereas a deletion of 
residues 122 through 126 fails to produce a detectable protein, probably owing 
to markedly impaired stability. 

Similar overall 4 a-helical bundle structure exists or is predicted for 

30 several other cytokines/hormones, such as growth hormone (GH), granulocyte- 
macrophage colony-stimulating factor (GM-CSF), IL-3, IL4 and IL-5 (Bazan. 
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J.F. et fl/., ImmunoL Today 11: 350-354 (1990)). Like Epo. GH exhibits a 
high degree of primary sequence conservation and biological crossreactivity 
between mammals. The mouse, rat, pig and sheep proteins show about 80% 
amino acid identity with human GH. In other cytokines, however, there is 

5 more sequence diversity among species. For example, the human and murine 
forms of IL-5, GM-CSF and IL-3 show respectively only 69. 67 and 26% 
amino acid homology. Furthermore, unlike Epo or GH, those proteins lack 
inter-species biological cross-reactivity. 

Expression of the Mammalian Epo Hormones. Mammalian Epo 

10 cDNAs were subcloned into the pSG5 plasmid and transiently expressed in the 
monkey Cos? cell line. Supematants of the transfected cells were able to 
sustain the cellular proliferation of the murine Epo-dependent HCD57 cell line, 
demonstrating the biological cross-reactivity between species. As the human 
Epo antisera used in our radioimmunoassay bind to Epos from other species 

15 with variable affinity, the amount of produced protein was difficult to 
determine accurately. Experiments are in progress to further characterize the 
relative affinities of the various mammalian erythropoietins toward murine and 
human Epo receptors. 

Phylogenetic Analyses of Epo Sequences. Nucleotide sequences 

20 encoding the full length mature Epo protein were analyzed by the maximum 
parsimony method (Czelusniak, J. et aL.Meth. EnzymoL J85:601-615 (1990); 
Stanhope, M.J. et al, MoL Phylog. Evol. 7:148-160 (1992)), An "all trees" 
set of computer programs determined the parsimony length of each of the 945 
unrooted trees formed by the seven cDNA sequences (the terminal taxa) 

25 (Figure 4) and, on ordering those 945 trees according to increasing length, 
identified the minimum number of extra nucleotide substitutions needed to 
break up each group in the maximum parsimony (lowest length) tree. As 
shown in Figure 7 and 7A, the mouse and rat are strongly grouped, as are the 
human and monkey. The pig and sheep are more weakly grouped, in accord 

30 with accepted phylogenetic views, which place the divergence of these animals 
about 55-60 million years ago. As expected with the parsimony reconstruction 
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on only the cxxion sequences, silent base substitutions occurred much more 
readily than amino acid replacements (Figure 7B). 

The Epo intron 1-exon 2 sequences shown in Figure 3 were also 
analyzed by the maximum parsimony method. Figure 8 shows strength of 

5 grouping results based on all 135, 135 unrooted trees formed by nine terminal 
taxa (in this case, cat and Hon sequences, which are nearly identical, were 
treated as a single taxon), and Figure 8A shows a phylogenetic tree that- 
combines maximum parsimony with established phylogenetic evidence. As the 
most parsimonious tree (Figure 7) for these relatively short sequences failed 

10 to join dog to the feloid (cat and lion) branch (reflecting the fact that the 
ancestral split between canoids and feloids traces back to the early carnivores 
at about 60 million years ago), the phylogenetic tree shown in Figure 7A is a 
near most parsimonious tree in which dog joins the feloids. The rodents 
(mouse and hamster) are strongly grouped, as are the feloids (cat and lion), 

15 and at more moderate strength the cetacean (dolphin) is grouped with the 
artiodactyls (sheep and pig). There are other molecular data as well as some 
paleontological data that depict cetaceans originating from early artiodactyls 
(Czelusniak, J. etaL Meth, EnzymoL 783:601-615 (1990); Czelusniak. J. 
. etaL, In "Current Mammology", Volume 2, 3rd ed.. H. H. Ganoways. 

20 pp. 545-572, Plenum Publ. Corp. (1990); Irwin, D.M. etaL, J. MoL 
EvoL J2: 128-144 (1991); Gingreich, P.D. et aL, Science 249:154-151 (1990); 
Novacek, M.J., Nature 356:121-125 (1992)). 
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EXAMPLEII 

Structure Function Relationships of Erythropoietin: 
Muteins thai Test a Model of Tertiary Structure 

Summary 

5 On the basis of its primary sequence and the location of its disulfide 

bonds, we propose a structural model of the erythropoietic hormone 
erythropoietin (Epo) which predicts a 4 a-helical bundle motif, in common 
with other cytokines. In order to test this model, site directed mutants were 
prepared by high level transient expression in Cos? cells and analyzed by a 

10 radio-immune assay and by bioassays utilizing mouse and human Epo 
dependent cell lines. Deletions of 5 to 8 residues within predicted a helices 
resulted in the failure of export of the mutant protein from the cell. In 
contrast, deletions at the N-terminus (Al-6), the C-terminus (A163-166), or in 
predicted interhelical loops (AB: A32-36, A53-57; BC: A78-82; CD: 

15 Al 1 1-119) resulted in the export of immunologically detectable Epo muteins 
that were biologically active. The mutein A48-52 could be readily detected 
. by RIA but had markedly decreased biological activity. However, 
replacement of each of these deleted residues by serine resulted in Epo 
mutpins with full biological activity. Replacement of Cy^9 and Cys33 by 

20 tyrosine residues also resulted in the export of fully acdve Epo. Therefore this 
small disulfide loop is not critical to Epo's stability or function. The 
properties of the muteins that we tested are consistent with our proposed 
model of tertiary strucnjre. 
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Introduction 

Humoral regulation of red blood cell production was first proposed at 
the beginning of this century (Camot, P., et al.. C.R. Seances Acad. 
ScL 7^5:432-35 (1906)). Convincing physiologic experiments documenting 
5 the existence of erythropoietin (Epo) (Krunidieck. N., Proc. Soc. Exp. Biol. 
Med54'.Wn (1943); Reissmann. K.R., Blood 5:372-3^0 (1950); Erslev, A.. 
Blood 8:349-357 (1953); Jacobson. L.O.. et al.. Nature 779:633-634 (1957)) 
were followed by its purification (Miyake, T., etal.. J. Biol. 
Chem. 252:5558-5564 (1977)) and partial structural characterization (Lai. P.- 
10 H., et al., J. Biol. Chem. 26i:31 16-3121 (1986)). The molecular cloning of 
this biologically and clinically important cytokine (Jacobs, K., etal.. 
Nature 313:906-m (1985); Lin. F.-K., etal.. Proc. Nat. Acad. Sci. 
USA 82:7580-7585 (1985)) has led to further understanding of its properties 
(Sasaki, H., et aL, J. Biol. Chem. 262:12059-12076 (1987); Davis, J.M., 
15 etal.,Biochenustry 26:2633-263% (mi)). 

The binding of Epo to its cognate receptor (D' Andrea, A.D., et al. 
Cell 57:277-285 (1989)) on crythroid progenitors in the bone marrow results 
in salvaging these cells from apoptosis (Kouty, M.J., etal.. 
Sdence 248:378-381 (1990)). allowing them to proliferate and differentiate 
20 into circulating erythrocytes. The Epo receptor is a member of an ever 
enlarging family of cytokine recq)tors (Bazan, F., Biochem. Biophys. Res. 
Commun. 764:788-796 (1989)). In like manner. Epo shares weak sequence 
homology with oUier members of a family of cytokines which also include 
growth hormone, prolactin, 11^2. IL-3, IL4, IL-5. IL-6, IL-7. GCSF. GM- 
25 CS¥, M-CSF, oncostatin M, leukemia inhibitory factor and ciliary 
neurotrophic factor (Parry. D.A.D., etal., J. Mol. Recognition 7:107-110 
(1988); Bazan. F.. Immunology Today 77:350-354 (1990); Manavalan. P.. 
etal.. J. Protein Chem. 77:321-331 (1992)). The genes encoding tiiese 
proteins have similar numbers of exons as well as a clear relationship between 
30 intron-exon boundaries and predicted a-helical structure. These similarities 
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have led to the prediction that this family of cytokines share a common pattern 
of folding into a compact globular structure consisting of four amphipathic 
a-helical bundles. Such theoretical models of the structures of human growth 
hormone (Cohen. F.E.. etaL, Proteins: Struct. Func. Genet. 2:162-166 

5 (1987)) and IL-4 (Curtis. B.M., et aL Proteins: Struct. Func. 
Genet. Ji:l 11-119 (1991)) have been in remarkably good agreement with 
subsequent structures established by X-ray diffraction (HGH) (Abdel-Meguid. 
S.S., et fl/., Proc. Nail. Acad. ScL USA 54:6434-6437 (1987); deVos, A.M.. 
et ai. Science 255:306-312 (1992)) or by multidimensional NMR (IL4) 

10 (Redfield. C, et aL Biochemistry 30.11029-11035 (1991); Powers. R., et al, 
Science 256:1673-1677 (1992)). Moreover, the crystal structures of GM-CSF 
(Diederichs, K., et aL, Science 254:1779-1782 (1991)) and monomeric M- 
CSF (Pandit, J., et aL, Science 258:1358-1362 (1992)) are also in reasonable 
agreement with their predicted structures. 

15 Thus far, the structure of Epo has not been analyzed by either X-ray 

diffraction or by NMR. In order to begin to gain an understanding of 
structure-function relationships, we have taken a three-pronged approach: 

a) Sequence determination of Epo from mammals of different 
orders in order to establish regions of homology. This work is described in 

20 Example 1. 

b) Construction of a model of the three-dimensional structure of 
Epo, followed by the design and preparation of muteins that test this model. 
These experiments are presented in tiiis Example. 

c) Design and testing of muteins that provide information on 
25 receptor binding domain(s). This work is presented in Example III. 
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Materials and Methods 

Computer-based Modeling of Structure 

Prediction of Secondary Structure. Epo sequences from human, 
monkey, mouse, rat, sheep, pig. and cat were aligned (See Example I), and 

5 examined using a hierarchical approach to secondary structure prediction that 
assumes that these proteins are members of the da folding class (Levitt, M. 
et al.. Nflft/re 267:552-558 (1976)). First, thepattem-based method of Cohen. 
F.E. et al.. Biochemistry 25:266-275 (1986) for turn prediction was used to 
delimit sequence blocks likely to contain secondary structure. Predictions using 

10 the methods of Gamier. J. et al.. J. Mol. Biol. 720:97-120 (1978) and Chou. 
P.Y. etal., Ann. Rev. Biochem. -#7:251-276 (1978) suggested o-helical 
regions within these blocks. Finally, helical wheel projections were used to 
examine and then limit helix length based on preserving amphipathic character 
as codified in the work of Presnell. S.R. et al., Biochemistry 57:983-993 

15 (1992). The locations of glycosylation sites were also used to suggest helix 
boundaries. 

Tertiary Structure Prediction. Earlier investigations have revealed the 
general principles of helix-to-helix packing in globular proteins (Richmond. 
T.J'.. etal., J. Mol. Biol. 779:537-55 (1978)). Exploring these principles. 

20 Cohen, F.E. et al., J. Mol. Biol. 732:275-288 (1979) developed a method for 
the generation of three-dimensional protein structures from the secondary 
structure assignment. These methods have been applied to myoglobin, tobacco 
mosaic virus coat protein, growth hormone, a- and /3-interferon, IL-2.and IL-4 
(Cohen. F.E.. et al.. J. Mol. Biol. 752:275-288 (1979); Cohen, F.E., et al.. 

25 Science 254:349-352 (1986); Cohen. F.E.. et al.. Proteins: Struc. Func. 
Gen. 2:162-166 (1987); Sternberg. M.J.E., etal.. Int. J. Biological 
Macromolecules 4:137-144 (1982)). 
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The algorithm for tertiary structure generation is divided into four 
computations. The program aapatch identifies clusters of hydrophobic residues 
within the putative helices that could mediate helix-helix interactions 
(Richmond. T.J., et aL MoL Biol 779:537-55 (1978)). Aafold generates 

5 all possible helix pairings according to the location and geometric preferences 
of the interaction sites. Aabuild generates the three-dimensional models of all 
possible strucmres from the list of helix pairings (from aafold) and subject to 
stearic restrictions and geometric constraints on chain folding. In the final 
step, aavector applies the user defined distance constraints (e.g., disulfide 

10 bridges) to the structures generated. At this stage, coordinates have been 
specified only for residues in the core cr-helices. For residues in sequentially 
distinct loops, lower bounds on the inter-residue distances can be inferred from 
the relevant helix terminus. 

Preparation of Epo Muteins 

15 Construction of the Mutagenic/Mammalian Expression Plasmid. A 

M13 plasmid, containing a 1.4 Kb EcoRI-EcoRI human Epo cDNA insert 
(XHEpo FL12) was a gift from Genetics Institute (Cambridge, MA) (Jacobs, 
K., ,et ai. Nature 373:806-810 (1985)). A 943 bp EcoRI-Bglll fragment, 
corresponding to the complete coding sequence of the wild type human 

20 erythropoietin, including untranslated regions 216 bp upstream and 183 bp 
downstream, was inserted into the mammalian expression plasmid pSG5 
(Stratagene) (Sambrook, J., etaL, In "Molecular Cloning: A Laboratory 
Manual", 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
pp. 1.38-1.41 (1989)) and named pSG5-Epo/WT. 

25 Site-Directed Mutagenesis was carried out according to the protocol 

described by Kunkel, T.A., etaL, Meth. EnzymoL i«:367-382 (1987). 
Single-stranded DNA was rescued from the pSG5-Epo/WT phagemid grown 
overnight in E. coli CJ236, in 2XYT media containing M13K07 helper phage 
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(In Vitiogen) and 70 /xg/ml kanamacyn (Sigma). The resulting uracil- 
containing ssDNA was used as a template for mutagenesis. Oligonucleotides 
(24 to 46 mer) were synthesized with their 5' and 3' ends complementary to 
the target wild type Epo sequence. A large variety of mutations (base 
5 substitutions, deletions and insertions) were created at the centers of the 
mutagenic primer sequences. Annealing of the phosphorylated primers (10: 1 
oligonucleotide/DNA template molecular ratio) was performed in 10 /il of a 
20 mM Tris-HCI pH 7.4. 2 mM MgClj, 50 mM NaCl solution. The reacUons 
were incubated at SO'C for 5 minutes, then allowed to cool slowly to room 
10 temperature over a one hour period. The DNA polymerization was initiated 
by the addition as a mix of 1 ;xl of lOX synthesis buffer (100 mM Tris-HCL 
pH 7.4, 50 mM MgClj. 10 mM ATP. 5 mM each dNTPs, 20 mM DTT), 0.5 
^\ (8 units) of T4 DNA ligase and 1 /xl (1 unit) of T4 DNA polymerase 
(Boehringer Mannheim). After 2 hours at 37'C. 80 /tl of IX TE was added. 
15 5 n\ of the diluted reaction mix was used to transform competent E. cott 
NM522 (ung*. dut*). 

Since a 40-80% mutation yield is normally obtained, 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced with 7-deaza- 
dGTP and Sequenase (U.S. Biochemical) (Sanger, F., etal., Proc. Natl. 
20 Acad. Sci. USA 7^:5463-5467 (1977)). As a rule, the entire coding sequences 
of the 1^ mutants were examined for the presence of unwanted mutation by 
sequencing or restriction enzyme mapping. 

Production of Wild Type Epo and £jpo Muteias in Mammalian Cells. 
Cos7 cells grown to approx. 70% confluency were transfected with lO ng 
25 recombinant plasmid DNA per 10 cm dish using the calcium phosphate 
precipiution protocol (Kingston. R.E., etai. In "Currem Protocols in 
Molecular Biology", Green Publishing Associates & Wiley Intcrscience, NY, 
pp. 9.1.1-9.1.9 (1991)). As a control of transfection efficiency, in several 
experiments 2 /tg of pCHllO plasmid (Pharmacia) was co-transfected and P- 
30 galactosidase activity measured in the cytoplasmic extracts. 
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RNA BIct-Hybridization Analysis. Total RNAs were prepared from 
cultured Cos7 cells (Chirgwin, J.M., etaL, Biochemistry 78:5294-5299 
(1979)) and 2 |xg samples electrophoresed on a 1.1% agarose gel containing 
2.2 M formaldehyde. Transfer to GeneScreen Plus filters (New England 
5 Nuclear) and hybridization with a ^P-labeled wt Epo probe were carried out 
as previously described (Faquin, W.C., et aL Blood 79:1987-1994 (1992)). 

Quantitation of Tmnsiently-Expressed Recombinant Epos. The 
amount of secreted protein in the supematants of transfected Cos7 was 
determined by a radioimmune assay (RIA). The RIA was performed using a 
10 high titer rabbit polyclonal antiserum raised against the human wild type Epo 
and produced in our laboratory, *^I-labeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C., etal.. Exp. HematoL 21: 420-426 (1993)). 

Immunoprecipitation of^S-labeled Epo Proteins. Three days after 
15 transfecuon, the Cos7 monolayers were washed extensively with IX PBS and 
the cells incubated for 20 minutes at 37**C in 2 ml of MetVCys' Minimum 
Essential Media Eagle modified. In each culture dish, 100 /il of TRAN35S- 
LABEL (p*S]-cysteine and p^SJ-methionine, - 10 mCi/ml, ICN Biochemical) 
was then added. After two hours, the conditioned media were harvested and 
20 cellular extracts were prepared by lysis in radioimmune precipitation buffer 
(50 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.02% (w/v) sodium azide, 0.1% 
(w/v) SDS. 0.5% (w/v) sodium deoxycholate, 1 % (w/v) Triton X-100, 1 mM 
phenylmethanesulfonyl fluoride and 1 fxg/ml aprotinin). Samples were 
precleared with rabbit preimmune serum/protein A-Sepharose CL-4B 
25 (Pharmacia) for two hours. Immunopreclpitations were performed overnight 
with our polyclonal antibody specific for human recombinant wild type Epo 
and immunoadsorbed with protein A-Sepharose CL-4B. Immunoprecipitates 
were run on 15% SDS-polyacrylamide gels (Laemmli, U.K., Nature 
227:680-685 (1970)) and analyzed by autoradiography after treatment with 
30 Enhance (New England Nuclear). 
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Bioassays. The dose-dependent proliferation activities of wild type Epo 
and Epo muteins were assayed in vitro using three different target cells: 
murine spleen cells, following a modification of the method of Krystal 
(Krystal, G.. Exp. Hematol. 77:649-60 (1983); Goldberg. M.A.. et al.. Proc. 

5 Natl. Acad. Sci. USA 5^:7972-7976 (1987)); Epo responsive murine 
erythroleukemia cell line, developed by Hankins (Hankins. W.D., et al.. 
Blood 70-. 173a (1987)); and human ^xniependent UT-7/Epo cell Une, derived ■ 
from the bone marrow of a patient with acute megakaryoblastic leukemia 
(Komatsu, N., et al.. Cancer Res. 57:341-348 (1991)). After 22 to 72 hours 

10 of incubation with increasing amounts of recombinant proteins, cellular growth 
was determined by pHJ-thymidine (New England Nuclear) uptake or by the 
colorimetric MIT assay (Sigma) (Yoshimura, A., etal., Proc. Natl. Acad. 

Sd. USA 87:41394143 (1990)). 

Bac/«ria/ Erprwston. The wild type Epo target, corresponding to the 

15 nucleotide sequence coding for the mature protein, was PCR-amplified using 
appropriate primers. In the sense primer an Ndel site (CATATG) was placed 
immediately 5' to the Alal codon of the mature protein. In the antisense 
primer a Bglll site was placed 3' to the TO A stop codon. After enzymatic 
digestion, the 516 bp PCR fragment was inserted in an Ndel/BamHI-cut 

20 pET16b plasmid (Novagen), which has a T7 promoter followed immediately 
by the lac curator. IPTG induction of transformed £. coff BL21 (DE3) (T7 
RNA polymerase*, lotf, ompT) resulted in high levels of expression of a 
fusion protein with a ten histidine stretch at the amino terminus. The oligo- 
His tag allowed the binding of the produced (His,o)-Epo on a nickel affiiuty 

25 resin and its elution by increasing imidazole concentrations in presence of 
phenylmethylsulfonyl fluoride (Sigma). Most of the produced protein formed 
insoluble aggregates and was solubilized and affinity-purified under denaturing 
conditions in 6 M guanidine-HCl. Oxidative refolding was performed by 
overnight dialysis at 4»C against 50 mM Tris-HCl pH 8.0, 40 /tM CUSO4 and 

30 2% (wt/vol) Saricosyl. Soluble protein was further dialyzed against 20 mM 
Tris-HCl pH 8.0, 100 mM NaCl, 2 mM CaCl, and subjected to factor Xa 
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(New England Biolabs) cleavage to remove the NH^-polyHis sequence. 
Monitoring of the fusion protein following induction and during the various 
steps of purification was done by electrophoresis on a 15% polyacrylamide- 
SDS gel, stained with Coomassie Brilliant Blue. Alternatively, the His-Epo 

5 fusion protein was detected on Western Blot (Gershoni, J.M.. et al, Anal 
Biochem. 131:1-15 (1983)), using a 1/2000 dilution of our wt native Epo 
polyclonal antibody and a second bioiinylated rabbit-specific antibody which • 
is detected with a streptavidin-alkaline phosphatase conjugate (Amersham). 
In Vitro Tmnscription/Translatbn. Sense and antisense primers, 

10 creating new Bglll sites respectively 5' and 3' of the initiator and stc^ codons. 
were used in a polymerase chain reaction on pSG5-Epo/WT template. After 
Bglll cleavage, the 594 bp PCR-fragment was subcloned into p SP64T (Krieg, 
P. A., etai, NucL Acid. Res. 72:7057-7070 (1984)). This SP6 containing 
vector provides 5' and 3' flanking regions from Xenopus ^-globin mRNA, 

15 which allow efficient in vitro transcription/translation. Previous experiments 
showed poor yields of in vitro translated protein, when using the GC-rich 
natural 5' Epo untranslated region (UTR). One step in vitro transcription/ 
translation was carried out by incubation of 1 /xg of circular p64T-Epo in a 50 
/a1 reaction volume of SP6-TnT coupled rabbit reticulocyte lysate system 

20 (Promega), in presence of p'5]-cysteine (1200 Ci/mM, New England 
Nuclear). In some cases, canine pancreatic microsomal membranes were 
added to the reaction mix. A purified GST-human Epo receptor extracellular 
domain fusion protein (EREx) was a gift from W, Harris and J. Winkelman, 
and the binding of the ^S-labeled translation products onto EREx-glutathione 

25 agarose beads were performed as described (Harris, K.W., etaL, 7. Biol. 
Chem. 267:15205-15209 (1992)). 
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Results 

Construction of a Model of the Three-Dimensional Structure of 
Erythropoietin 

From an analysis of the putative Epo helix sequences, aapatch 

5 identified eight possible helix-helix interaction sites. In principle, these sites 
could be used to generate 1 .6 x lO" structures. Of these, only 706 maintained 
the connectivity of the chain and were sterically sensible. These structures 
resembled four helix bundles, an increasingly common motif in protein 
structure (Presnell, S.R., et al. Proc. Natl. Acad. Sd. 86:6592-65% (1989)). 

10 The structures that were not compatible with the native disulfide bridge 
between Cys, and Cys,4, were eliminated. This reduced the total number of 
structures from 706 to 184 (total computer time ^proximately 1 hr. on a 
Silicon Graphics IRIS 4D/35G), The remaining structures were then rank 
ordered by solvent-accessible surface contact area, a measure of the validity 

15 of model structures. The most compact structures were right handed, all anti- 
parallel four-helix bundles wiUi no overhand connections, but this may be an 
artifact of a failure to add tiie polypeptide chain tiiat forms the loops to the 
helical core constructed by aabuUd. The other less compact structures were 
left-handed four-helix bundles witii two overhand loops, a topology previously 

20 seen in d>e strucmres of IL4 and growtii hormone. We suspect tiiat diis is die 
likely structure for Epo. The consensus for assignments of putative a-helices 
in human Epo are summarized in Table III. First, analysis of tiie topological 
distribution of known four-helix bundle structures indicate tiiat neariy all 
examples have an antiparallel orientation (Presnell, S.R., et al, Proc. Natl. 

25 Acad. Sd. 56:6592-6596 (1989)). Second, Uie left-handed four-helix bundles 
witii two overhand connections arranges tiie four amphipatiiic helices to form 
a compact hydrophobic core. Finally, a patii for tiie loop regions of Epo can 
be imagined by analogy to IL-4 and growtii hormone diat preserves tiie 
compact globular nature of die Epo model structure. Figure 9 shows 
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schematic representations of the possible topological interactions between the 
four-anti-parallel a-helical bundles. 







Table III 




Predicted cr-helical regions of the mature erythropoietin protein 


Helix 


N-terminus 


C-terminus 


Potential Helix-Helix 
. Interaction Sites 


A 


9 


22 


19 


B 


59 


76 


63. 67. 70. 71 


C 


90 


107 


95, 102 


D 


132 


152 


141 


Data were obtained using the various algorithms for secondary and 
tertiary structure generations described under Materials and Methods. 



Several authors have suggested that the helical cytokines form a 
structural superfamily (Cohen, RE., etal., Science 254:349-352 (1986); 
Parry, D.A.D., ei cL, J, Mol Recognition 7:107-110 (1988); Bazan. J.F., 

15 Biochem. Biophys. Res. Comm. 764:788-795 (1989); Cosman, D„ etal., 
Trends Biochem. Sci. 75:265-270 (1990); Bazan, J.R, Proc. NaiL Acad. Sci. 
U,S.A. 87:6934-6938 (1990)). On the basis of both the mature protein and the 
individual a-helices, Epo seems to be more closely related to growth hormone, 
prolactin, IL-6 and GM-CSF rather than the other members of the helical 

20 cytokine superfamily. Nevertheless, recent improvements in algorithms for 
the identification of distant evolutionary relationships between proteins from 
structural fingerprints suggested that it might be possible to align the IL-4 
structure to the Epo sequences. The Eisenberg (Bowie, J.U., et aL, 
Science 253:l(A'nO (1991)) structural environment and 3D-1D profile 

25 methods are a powerful tool for recognizing that a sequence is compatible with 
a known structure, e.g., a four-helix bundle. The NMR structure of IL-4 
from Smith, L.J., etaL MoL BioL 224:899-904 (1992) was used to 
construct a 3D-1D profile. A mixture of sequences including four helix 
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bundles. globins, and non-helical structures were aligned against the IL-4 
profile. Not surprisingly, the IL-4 structures from human and mouse gave the 
highest scores 0 = 22.8 and 8.1). However, the other known four-helix 
bundle cytokines known to share a similar fold with IL4, e.g., human growth 

5 hormone (Abdel-Meguid, S.S., tt qL Proc. Nail. Acad. ScL W:6434-6437 
(1987)) (Z = 2.3) and GM-CSF (Diederichs, K., etaL, Science 
254:1779-1782 (1991)) (Z = 2.3) fared no better than some globin sequences 
(Kuroda's and slug sea hare globin, Z = 5.0 and 4.8) that adopt a distinct 
tertiary structure. The results for the human and sheep Epo sequences were 

10 also ambiguous (Z = 1.6 and 0.8). These results suggest that while profile 
methods are a powerful tool for recognizing structural similarity, their failure 
to identify homology does not exclude the possibility that two proteins share 
a common fold. For distantly related or unrelated structures, current profile 
methods cannot replace de novo methods for tertiary structure prediction. 

15 Design and Expression of Epo Muteins that Test the Proposed Structure 

To test the proposed four a-helical bundle structure of erythropoietin 
and at the same time to attempt to locate functional domains, we created by 
site-directed mutagenesis a series of deletion, insertion and replacement 
mutants. These muteins were designed to analyze the principal predicted 
20 structural feanircs of the molecule: a-helices. interconnecting loops, as well 
as' the NH, and COOH termini. Structural and functional implications of the 
disulfide bridges and the glycosylation sites were also investigated. 

a Helices. Short amino acid deletions were prepared in, or close to, 
the predicted A, B, C and D a helices. Human wild type and muteins were 



25 ^Z-scores are used to describe the normalized weight associated with a 
profile score. A distribution is built from a collection of sequences with a 
mean 2 score of 0.0 and a standard deviation of 1.0. Z-scores greater than 
6.0 arc associated with significant alignments. Z-scores between 3.0 and 6.0 
may or may not be structurally relevant. 
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tiansiently expressed in Cos7 cells. Northern blot analyses demonstrated that 
all the mutant plasralds produced about the same amount of mRNA as that of 
the wild type (data not shown). Yet, no detectable amount of Epo protein 
could be found in the Cos? supematants, either by radioimmunoassay or by 
5 bioassay using various ^x>-dependent cell lines. Table IV summarizes these 
findings. 



mutants. 
A 12-16 

A 65-69 



A 96-100 
A 105-109 



A 122-126 

A 131-135 
A 140-144 
A 142-150 
A 152-155 

A 156-160 



TABLE IV 
Short deletions in, or close to, o helices 



9 
i 

22 



59 
76 



90 
107 



A HELIX 



B HELIX 



C HELK 



132 
I 

152 



D HELK 



- RNA levels comparable to WT. 

- no detectable Epo in the Cos7 
supernatant, both by RIA and 
bioassay. 



Predicted N and C termini of each o helix are indicated in the vertical boxes. 

An example of SDS-PAGE of immunoprecipitants from in vivo "S- 
labeling is presented in Figure 10. As expected, when Cos7 cells were 
transfected with pSG5-Epo/WT. a 35-37 KD band was detected in the 
supernatant. In contrast, the deletion mutants (Table IV) could be detected in 
cellular extracts but were not exported from the cells. Figure 10 shows the 
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cytoplasmic retention of the mutein A140-144, lacking four residues in the 
middle of the predicted D helix. The apparent molecular weight (around 28 
IcD) is less than expected for a five-amino acid deletion. Therefore not only 
the secieuon, but also the glycosylation. seem to be impaired. None of these 
5 muteins had deletion of glycosylation sites. It is likely that full glycosylation 
of Epo requires conservation of its molecular architecture. Similar results 
(reported in Table IV) were obtained for all die muteins having partial deletion 

of an a helical peptide segment. 

Because contaminants in crude Cos? cellular extracts severely interfere 
10 with the radioimmunoassay, no direct Epo quantitation was possible. 
However, aliquots of hypotonic extracts of Cos? transfected witii wild type 
Epo were able to sustain HDC57 proliferation. No similar biological activity 
was found for muteins witii limited deletion of a helices. 

Jnunonnecting Loops. The peptide segment joining A and B helices 
15 presents several interesting features (Figure 11). AB loop consists of 36 
amino acids. Two N-glycosylation sites and a small disulfide bridge are 
located in the first half and their biological implications will be discussed later. 
The COOH end of die AB loop contains a stretch of amino acids that is 
strongly conserved among mammals (See Example I). Alignments of human, 
20 monkeys, cat, mouse, rat, pig, and sheep Epos showed a consensus sequence: 
DTKVNFYAWKR(M/I)(E/D)VG (residues 43 to 57) [SEQ. ID NO: 4]. 
Three deletions were constructed: A43-47. A48-52, and A53-5?, and 
tiansiendy expressed in Cos?. The amount of muteins detected by RIA in the 
supematants of transfected cells was 10 to 40% lower tiian observed witii wild 
25 type Epo (Figure llA). Nevertiieless, die diree secreted muteins were 
biologically active. However, because A48-52 exhibited a marked decrease 
of the specific bioactivity. diis site was studied in more detail by means of 
serine replacements. Krystal ex vivo bioassay as well as HCD5? and 
UT-?/Epo in ntro bioassays showed that tiiese Ser mutants had biological 
30 activities similar to diat of wild type (Figure IIB). Therefore, tiie observed 
decreases in botii RIA and bioassay for the diree deletion mutants are likely 
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to be the result of changes of structural conformation. The long length of loop 
AB may be critical for the up-up-down-down topography. A shorter AB 
segment may impose a strain on the intcrhelical connection. Chou and 
Fasman algorithms (Chou, P.Y., etcL Ann. Rev. Biochem. <^7:25 1-276 

5 (1978)) predicted a short jS-sheet structure from residues 44 to 51 (<Pa> ; 
<Pb>, 1.005 < 1.196). The presence of a short region of jS-sheet in the 
connection between helices 1 and 2 (A and B) have been documented in the 
analyses of the three-dimensional structures of IL4 (Redfield, C, et al., 
Biochemistry 30:nCfl9'\m5 (1991); Powers, R., etaL, Science 256:1613- 

10 1677 (1992)). GM-CSF (Diederichs, K., etaL, Science 254:1779-1782 

(1991) ), and monomeric M-CSF (Pandit. J„ etal., Science 258:1358-1362 

(1992) ). In contrast, in human GH a short segment of a helix is found at the 
same location (Abdel-Meguid, S.S., etcl., Proc. Natl. Acad. So. USA 
84:6434-6437 (1987)). The structure/function implications of tiiese short 

15 features are not yet understood. 

Helix B is linked to helix C by a much shorter segment (residues 77 
to 89) and contains in its center the third N-glycosylation site (Asn83). When 
the A78-82 mutein was expressed, a secreted protein was detected in the 
conditioned medium and conferred proliferative bioactivity on Epo-dependent 

20 cell lines (see Figure 16). 

A similar long crossover connection (23 amino acids) is found between 
helix C and helix D. In contrast to what we previously observed for loop AB, 
a iarge deletion of nine residues at position 111-119 or a seven amino acid 
insertion of a myc epitope after residue 116 did not affect tiie secretion of 

25 tiiese mutelns (Figure 12). Furtiiermore, tiicse two proteins had normal 
specific activity, as seen by the ratio of bioassay to RIA. Our rabbit 
polyclonal antibody raised against tiie native form of die human wild type fiiUy 
recognized the two mutants, demonstrating that the overall spatial 
conformation of Epo was well preserved. According to die algoritiim of 

30 Emini, E.A., et al., J. Virol 55:836-839 (1985), die residues 111-119 are 
predicted to be at the surface of die molecule. Primary amino acid alignments 



wo 94/24160 



PCT/US94/04361 



-58- 

of mammalian Epo showed a large variation in the sequence of residues 116 
to 130. including amino acid deletion, insertion, and substitution (See 
Example 1). Surprisingly, when the deletion A122-126 mutein. which 
removed the 0-glycosylation site (Ser 126), was transiently expressed in 
5 monkey cells, protein secretion was inhibited. Both rodents, rat and mouse, 
lack the 0-glycosylation site because of a Serl26 to Pro replacement. 
Furthermore, when a Serl26 replacement mutein was expressed in normal • 
CHO cells. (DeLorme, E.. era/.. Biochemistry 537:9871-9876 (1992)) or 
when wild type Epo was expressed in cells having a defect in 0-linked 
10 glycosylation (Wasley. L.C., et al.. Blood 77:2624-2632 (1991)), neither 
secretion nor biological activity were impaired. Therefore, failure of secretion 
of the A122-126 mutein may be the result of some other structural alteration. 
In particular, the proline residue at position 122 is invariant among mammals. 
NandC Termini. Deletion of residues 2 to 6 only slightiy affected 
15 the processing of a biologically active protein (see Figure 16). This deletion 
may impair cleavage of tiie propeptide, therefore explaining the lower yield 
of secreted Epo mutein in comparison to that of wild type. The fact tiiat die 
mature monkey protein has an elongated (Val-Pro-Gly) N-terminus strongly 
suggests tiiat tiie N-tcrminal part is not involved in die bioactivity of tiie 
20 molecule. Further evidence comes from the results, rqwrted below, on tiie 
N-poly-His-^ fusion protein expressed in E. coli, and also from tiie identical 
binding of w vitro translated «S-labeled wild type Epo onto EREX-glutatiiione 
agarose beads (Figure 13), witii or witiiout addition of canine pancreatic 
microsomal membranes which permit cleavage of tiie propeptide.* 
25 The C-terminal sequence following helix D can clearly be divided into 

two distinct domains, separated by Cysl61. The residues 151 to 161 were of 
^ial interest because diey are highly conserved among mammals (See 
Example I). There are only two substitutions: Lysl64 replaced by a Thr in 



30 



'All tiie mutants described in tiiis example were subcloned into pSPG4T 
plasmid. 
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artiodactyls and cat, and Alal60 is replaced by a Val in mouse Epo. Both the 
A152-155 and the A156-160 muteins remained in the cytosol of the transfected 
Cos? (Table IV), One possible explanation is that the residues 152 to 160 
may, in fact, participate in the D helix. We predict that GlylSl is the break 
5 point of the structure. However, it is possible that this residue causes only a 
bend in the a helical structure and helix D may extend to Glyl58. 

The C-terminal part of the protein (residues 162 to 166) is clearly not 
involved in any structural or functional feature. Thus, the deletion of the four 
last amino acids or the replacement of residues 162-166 by either a KDEL 
10 sequence or a poly-histidine sequence* did not modify the specific activity of 
the erythropoietin (Figure 14). Radioimmunoassay revealed that the secretion 
of the KDEL-tail mutein in the media of transfected cells was 45% less than 
normally obtained with the wild type Epo. However, when compared to the 
wild type, this mutein had more biological activity in the hypotonic Cos? cell 
15 extracts. The KDEL COOH-terminal sequence has been shown to be essential 
for the retention of several proteins in the lumen of the endoplasmic reticulum 
(Andres, D.A.. et cL, J. Biol Oiem. 266:14288-14282 (1991)). 
Nevertheless, because of overproduction in transiently expressed cells, a large 
percentage of recombinant protein escaped into the media. 
20 Disulfide Bridges, Wang et al demonstrated that the biological 

actiyity of Epo was lost irreversibly if the sulfhydryl groups were alkylated 
(Wang, F.F.. et d., Endocrinology ii6:2286-2292 (1985)). The mature 
human Epo has two internal disulfide bonds: Cys?=Cysl61, linking the NHj 
and COOH termini of the protein and a small bridge between Cys29 and 
25 Cys33, Cys33 was previously changed to Pro by site-directed mutagenesis, 
and the resulting protein was reported to have greatly reduced in vitro 
biological activity (Lin, F.-K., Molecular and Cellular Aspects of 



^The poly-His tail wild type mutant was purified by means of nickel 
affinity chromatography which will enable quantitation of cytosolic-retained 
30 mutants. The (His)6 COOH-terminal sequence has been appended to all the 
muteins described in this example. 
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Erythropoietin and Erythropoiesis //8:23-36, NATO ASl Scries, Springer- 
Verlag. Berlin Heidelberg (1987)). However, rat and mouse Epos have the 
same substitution and yet exhibit full cross-species bioactivity. To resolve the 
role of the small disulfide bridge in human Epo function, we created a 
5 C29Y/C33Y double mutation. The resulting mutein was normally processed 
and showed the same in vitro bioactivity as the wild type (Figure 11 A).* 
Furthermore, the deletion of five amino acid residues (A32-36) did not impair 
the secretion of a biologically active mutein. These data demonstrated tiiat only 
die native and fully conserved disulfide bridge Cys7=Cysl61 is cnicial for the 
10 preservation of die molecular structure of erydiropoietin. 

Functional Role of the GlycosyMon. Natural or recombinant human 
Epo is a heavily glycosylated protein; 40% of its molecular weight is sugars 
(Davis. J.M.. etaL. Biochemistry 26:2633-2638 (1987)). The protein has 
three N-linkcd oligosaccharide chains, located at amino acid positions 24 
15 and 38 (in predicted loop AB) and position 83 (in loop BC). It has one 0- 
linked carbohydrate chain at position 126 (in loop CD), which is missing in 
rodents. The role of these sugar chains in tiie biological activity of the human 
hormone has been extensively studied. Site-directed mutagenesis at die N- 
. glycosylation sites demonstrated tiiat even diough die sugars were important 
20 for proper biosyntiiesis and secretion. Uieir removal did not affect in vitro 
activity. This finding was corroborated by several investigators (Yamaguchi, 
K.. et al.. J. Biol. am. 266:20434-20439 (1991); Dub6. S.. et al.. J. Biol, 
dim. 265:17516-17526 (1988)). However. Takeuchi. M.. etal., J. Biol. 
Chem. 265:12127-12130 (1990) found diat N-glycanase digestion results in 
25 almost complete loss of biological activity. In contrast, there is general 
consensus diat glycosylation plays a key role in die biological activity of die 
hormone in vivo. Various reports have demonstrated diat die N-linked sugar 
chains enhance die stability and survival of Bfo in die blood stream (Fukuda. 



*rwo single replacement muteins (C29Y and C33Y) were also stable and 
30 had full biological activity (results not shown). 
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M.N., etai. Blood 73:84-89 (1989); Tsuda, E., etai, Eur. J. Biochem. 
iS8:405-411 (1990)) and protect the hormone against clearance by the liver 
(Sasaki, H.. et aL J> Biol Chem. 262:12059-12076 (1987)), thereby enabling 
the transit of the hormone from its site of production in the kidney to its target 
5 cells in the bone marrow (Spivak, J.L.. et aL Blood 75:90-99 (1989)). 

We expressed the wild type Epo in E. coli. Accordingly, the produced 
protein completely lacks sugar. The pET expression system was used and is 
detailed in the Methods section. IPTG induction of transformed 8L21 (DE3) 
bacterial strain rapidly results in a high level of expression of the poly-His Epo 
10 fusion protein (Figure 15A and 15B). After three hours of induction, we 
obtained a typical yield of - 1 mg of total protein per ml of culture. 
However, the vast majority of the expressed protein was present in the 
inclusion bodies and therefore its solubilization and purification on the nickel 
beads was performed in 6 M guanidine-HCl. Oxidative refolding and factor 
15 Xa cleavage resulted in soluble forms (Figure 15C) and the in vitro biological 
activity was tested on HDC57 cells. The cleaved E. coli recombinant Epo 
showed a notable decrease of the specific activity (10% less than the fully 
glycosylated mammalian expressed protein), but was still able to maintain 
HCD57 proliferation. The observed reduction of in vitro activity is likely to 
20 be due to improper refolding of the insoluble protein and impaired physical 
stability of the E. coli Epo as previously reported (Narhi, L.O., et aL, 7. BioL 
Chem. 266:23022-23026 (1991)). However, the fact that the E. coli Epo was 
still able to trigger HCD57 growth indicated an overall preservation of the 
molecular structure. The uncleaved fusion protein, with 10 His residues at the 
25 N-termlnus. exhibited a 67% loss in biological activity when compared to the 
cleaved protein. Thus, the addition of a 20-residue sequence to the N- 
terminus partially inhibited the biological activity. 
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Discussion 

Currently the accrual rate of new protein sequences through gene 
cloning far outstrips the rate of determination of three dimensional structure, 
Epo is among a large number of biologically important proteins which have 

5 not yet been analyzed by X-ray diffraction or NMR. The problem is simplified 
by cumulative evidence that the structures of most proteins are likely to be* 
variations on existing themes (Uvitt, M., et al. Nature 261 :552-55i (1976)). 
Indeed, as mentioned above, Epo appears to share common structural features 
with a large group of cytokines (Parry, D.A.D., et ai, J. Mol 

10 Recognition 7:107-110 (1988); Bazan, F., Immunology Today iJ:350-354 
(1990); Manavalan, P., et ai, J. Protein Chem. 77:321-331 (1992)). 

Computer-based prediction of structure can be reduced to a three stage 
process: secondary structure is predicted from the primary iamino acid 
sequence and, when available, optical measurements. Analysis of Epo by 

15 circular dichroism reveals about 50% a helix and no detectable ^ sheet (Lai, 
P.-H.. etal., J. Biol Chem. 267:3116-3121 (1986); Davis, J.M„ etal. 
Biochemistry 26:2633-2638 (1987)). With the knowledge of disulfide bonds, 
secondary structural elements are then packed into a set of alternative tertiary 
structures. The number of plausible arrangements can be reduced by 

20 empirical knowledge of preferred helix-helix packing geometries and the need 
for globular strucuire to form a hydrophobic core. The putative tertiary 
structure is tiien refined by standard force field calculations. Since tiiere are 
a large number of alternate tertiary structures, tiie availability of 
experimentally determined structure of a homologous protein is critically 

25 important. Thus die predicted model of Epo structure gains considerable 
validity by knowledge of die structures of GH (Abdel-Meguid, S.S,, et al, 
Proc. Natl Acad. Sd. USA 84:6434-6437 (1987); deVos, A.M., etal, 
Science 255:306-312 (1992)) and IW (Redfield, C, etal, 
Biochemistry 30:11029-11045 (1991); Powers, R., et aL, 

30 Science 256: 1673-1677 (1992)). 
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We have tested the predicted four anti-parallel a-helical bundle 
structure by means of site-directed mutagenesis. Deletions within predicted 
a helices would be expected to destabilize tertiary structure, whereas deletions 
or insertions in non-helical segments should be permitted unless they impose 

5 undue strain on the structure. For example, a deletion in an overhand inter- 
helical loop may result in insufficient length to connect the two helices. 
Results that we have obtained on muteins produced in mammalian (Cos?) cells 
are summarized in Figure 16. Our measurements of the quantities of 
processed mutein by RI A may underestimate the true amount of secreted Epo. 

10 Even a small deletion or insertion can result in a conformational change that 
may lead to impaired binding by our polyclonal antibody, raised against native 
human Epo. Thus the values for specific activity (biologic activity/RIA) that 
we report must be regarded as approximations. This caveat notwithstanding, 
our mutagenesis results are in good agreement with our proposed four 

15 a-helical model of erythropoietin. The proper folding of Epo into its native 
tertiary structure is necessary for stability and biological function. Muteins 
with short deletions inside predicted a-helices were not processed and exhibited 
no biological activity. In contrast, when deletions were created in predicted 

4 

interconnecting loops, secreted proteins were detected, to varying degrees. 

20 bodi by radioimmunoassay and bioassay. Furthermore, additions or deletions 
at the N- or C-termini did not maricedly impair the secretion and the biological 
activity of tiie Epo protein. Moreover, mutations at Cys29 and Cys33 showed 
that tiic small disulfide loop is not critical for biological activity. In order to 
delineate Epo's functionally important residues involved in tiie direct binding 

25 onto tiie Epo receptor, we have prepared and tested a series of amino acid 
replacements on tiie surfaces of the predicted a helices. These experiments 
are described in a Example III. 
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EXAMPLE III 

Erythropoietin Structure-Function Relationships: 
Muteins That Test Functionally Important Domains 



Introduction 

5 The Epo receptor is a member of an ever enlarging family of cytokine • 

receptors (Bazan, F.. Biochem. Biophys, Res. Commun, 764:788-796 (1989)), 
In like manner, Epo shares weak sequence homology with other members of 
a family of cytokines which also include growth hormone, prolactin, IL-2, 
11^3, IL-4. IL-5, IL-6, IL-7, G-CSF, GM-CSF, M-CSF, oncostatin M, 

10 leukemia inhibitory factor and ciliary neurotrophic factor (Parry, D.A.D. 
etai, L Mol Recognition i:107-110 (1988); Bazan F., Immunology 
Today iJ:350-354 (1990); Manavalan. P. et alj. Protein Chem, 7J:321-331 
(1992)). The genes encoding these proteins have similar numbers of exons as 
well as a clear relationship between intron-exon boundaries and predicted a- 

15 helical bundles. Such theoretical models of the structtires of human growth 
hormone (Cohen, F.E. etaL, Proteins: Struct. Func. Genet. 2:162-166 
(1987)) and IL4 (Curtis, B.M. etai. Proteins: Struct. Func. Genet. 
Ji: 1 1 1-1 19 (1991)) have been in remarkably good agreement with subsequent 
structures established by X-ray diffraction (HGH) (Abdel-Meguid, S.S, et d. , 

20 Proc. Natl. Acad. Sci. USA W:6434.6437 (1987); de Vos. A.M. etal.. 
Science 255:306-312 (1992)) or by multidimensional NMR (lL-4) (Redfield, 
C. etaL, Biochemistry 50:11029-11035 (1991); Powers, R. etaL, 
Science 256: 1673-1677 (1992)). Moreover, the crystal structures of GM-CSF 
(Diederichs, K. et ai , Science 254: 1779-1782 (1991)) and monomeric M-CSF 

25 (Pandit, J. etal., Science 258:1358-1362 (1992)) are also in reasonable 
agreement with their predicted structures. 

The structure of Epo has not yet been analyzed by either X-ray 
diffraction or by NMR. In order to begin to gain an understanding of 
structure-function relationships, we have taken a three-pronged approach: 
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a) Sequence determination of Epo from mammals of different 
orders in order to establish regions of homology. This work is 
described in Example I. 

b) Construction of a model of the three-dimensional structure of 
5 Epo, followed by the design and preparation of muteins that test 

this model. These experiments are presented in Example II. 

c) Design and testing of muteins that provide information on 
receptor binding domain(s). This work is presented in this 
Example. 

10 MATERIALS AND METHODS 

Construction of the Mutagenic/Mammalian Expression Plasmid. A 
M13 plasmid, containing a 1.4 Kb EcoTXl-EcoRl human Epo cDNA insert 
(XHEpo FL12) was a gift from Genetics Institute (Cambridge, MA) (Jacobs, 
K. et fl/.. Nature 3/i:806-810 (1985)). A 943 bp EcoRUBglll fragment, 

15 corresponding to die complete coding sequence of die wild-type human 
erythropoietin, including untranslated regions 216 bp upstream and 183 bp 
downstream, was inserted into the mammalian expression plasmid pSG5 
(Stratagene ) (Sambrook, J. etcL, In "Molecular Cloning: A Laboratory 
manual", 2nd ed. Cold Spring Harbor, Laboratory, Cold Spring Harbor, 

20 N.Y., vol. 1, pp. 1.38-1.41 (1989)) and named pSG5-EpoAVT. 

Site-Directed Mutagenesis was carried out according to die protocol 
described by Kunkel, T.A. et aL, Meth. EnzymoL 154:367'3i2 (1987). 
Single-stranded DNA was rescued from die pSG5-^oAVT phagcmid grown 
overnight in£. coli CJ236, in 2XYT media containing M13K07 helper phage 

25 (In Vitrogen) and 70 /xg/ml kanamacyn (Sigma, St. Louis, Mo). The resulting 
uracil-containing ssDNA was used as a template for mutagenesis. 
Oligonucleotides (24 to 46 mer) were synthesized witii their 5' and 3' ends 
complementary to die target wild-type Epo sequence, A large variety of 
mutations (base substitutions, deletions and insertions) were created at the 
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centers of the mutagenic primer sequences. Annealing of the phosphorylaied 
primers (10:1 oligonucleotide/DNA template molecular ratio) was performed 
in 10 Ml of a 20 mM Tris-HCL pH 7.4, 2 mM MgClj, 50 mM NaCl solution. 
The reactions were incubated at 80'C for 5 minutes, then allowed to cool 
5 slowly to room temperature over a one hour period. The DN A polymerization 
was initiated by the addition as a mix of 1 /il of lOX synthesis buffer (100 mM 
Tris-HCl pH 7.4 50, mM MgCl,, 10 mM ATP, 5 mM each dNTPs. 20 mM 
DTT). 0.5 ^1 (8 units) of T4 DNA ligase and I /il (1 unit) of T4 DNA 
polymerase (Boehringer Mannheim). After 2 hours at 37«C, 80 n\ of IX TE 
10 was added. 5 n\ of the diluted reaction mix was used to transform competent 
E. coli NM522 (ung*, dut*). 

Since a 40-80% mutation yield is normally obtained, 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced with 7-dea2a- 
dGTP and Sequenase (U.S. Biochemical) (Sanger, F. er a/. . Proc. Natl. Acad. 
15 Sci. USA 74:5463-5467 (1977)). As a rule, the entire coding sequences of the 
Epo mutants were examined for the presence of unwanted mutation by 
sequencing or restriction enzyme mapping. 

Production of Wild-Type and ^o Muteins in Mammalian Cells. 
Cos7 cells grown to ^prox. 70% confluency were transfected widi 10 ng 
20 recombinant plasmid DNA per 10 cm dish using the calcium phosphate 
precipitation protocol (Kingston. R.E. etal.. In "Current Protocols in 
Molecular Biology". Green Publ. Assoc. & Wiley Interscience. N.Y., vol. 1. 
pp. 9.1.1-9.1.9 (1991)). As a control of transfection efficiency, in several 
experiments 2 /ig of pCHllO plasmid (Pharmacia) was co-transfected and P- 
25 galactosidase activity measured in the cytoplasmic extracts. 

RNA Blot-Hybridization Analysis. Total RNAs were prepared from 
cultured Cos7 cells (Chirgwin, J.M. et iH.. Biochemistry 78:5294-5299 (1979)) 
and 2 /ig samples electrophoresed on a 1.1% agarose gel containing 2.2 M 
formaldehyde. Transfer to GeneScreen Plus filters (New England Nuclear) 
30 and hybridization with a "P-labeled wt Epo probe were carried out as 
previously described (Faquin. W.C. et al.. Blood 79:1987-1994 (1992)). 
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Quantitation of Transiently-Expressed Recombinant Epos, The 
amount of secreted protein in the supematants of transfected Cos? was 
determined by a radioimmune assay (RIA). The RIA was performed using a 
high titer rabbit polyclonal antiserum raised against the human wild-type Epo 

5 and produced in our laboratory. *"I-labeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C. et aL, Exp. HematoL 21: 42(M26 (1993)). 

Bioassays. The dose-dependem proliferation activities of wt and Epo 
muteins were assayed in vitro using three different target cells: murine spleen 

10 cells, following a modification of the method of Krystal (Krystal, G., Exp. 
HematoL i7:649-660 (1983); Goldberg, M.A. et at., Froc. Natl. Acad. Sci. 
USA 84:7972-7976 (1987)); murine Epo responsive murine cell line, 
developed by Hankins (Hankins. W.D. etai. Blood 7ft 173a (1987)); and 
human Epo-dependent UT-7/Epo cell line, derived from the bone marrow of 

15 a patient with acute megakaryoblastic leukemia (Komatsu, N et ai. Cancer 
Res. 57:341-348 (1991)). After 22 to 72 hours of incubation with increasing 
amounts of recombinant proteins, cellular growth was determined by pH]- 
thymidine (New England Nuclear) uptake or by the colorimetric MTT assay 
(Sigma) (Yoshimura, A. etai, Proc. natl. Acad. Sci. USA 87:4139-4143 

20 (1990)). 

RESULTS AND DISCUSSION 

Our objective is to identify the functionally important domains on the 
Epo molecule by preparing muteins that have altered q)ecific activity, i.e. 
significantly high or low biological activity per unit mass of protein. Such 
25 muteins must satisfy the following criteria: efficient secretion by Cos7 cells; 
full recognition (near normal binding affinity) by the polyclonal anti-Epo 
antiserum that we use in our radio-immune assay; preservation of the overall 
folding and three dimensional structure. In order to meet these criteria, the 
muteins that we employ in this study have only single amino acid 
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replacements. A comprehensive amino acid replacement scan of Epo would 
of necessity require the preparation and testing of a minimum of 166 muteins. 
Such an undertaking would be prohibitively labor intensive and costly. 

In order to devise a rational strategy to reduce the number of required 
5 replacements, we have taken advantage of information we have obtained 
employing scanning deletion muteins (See Example 11). On the basis of its 
primary amino acid sequence and disulfide bonds Epo. in common with other 
members of the cytokine family is predicted to have a four antiparallel 
amphipathic o-helical bundle structure. We have shown that deletions in non- 
10 helical regions at the N-terminus. the C-terminus and in loops between helices 
result in the formation of protein that is readily secreted from the cell and is 
biologically active. These regions can be ruled out as functionally important 
dom^ns such as the sites involved in die binding of Epo to its receptor. These 
results suggest tiiat die functionally importam contact sites are likely to reside 
15 on the predicted o helices. Furthermore, since die amphipathic helices are 
predicted to bind to one anotiier along their hydrophobic surfaces, die 
biologically relevant contact sites are likely to be residues predicted to be on 
die external surfaces of die helices. These considerations grcarty restrict die 
number of mutations needed to provide a comprehensive and informative body 
20 of data. 

• Because structure-function studies on growdi hormone and lL-4 have 
indicated diat Helices A and D are involved in receptor binding, most of our 
muteins have been directed to diese regions. 

Results on Helix A muteins are shown in Figure 17. The top panel 

25 shows a helical wheel projection. Replacement by alanine of die predicted 
external residues 13. 18. 20 and 21 have no significant impact on biological 
activity as determined by die relative incorporation of pH]-diymidine into Epo 
dependent HCD57 cells. However, die replacemem of ArgU by Ala showed 
markedly impaired biological activity. 

30 Figure 18, top panel, shows a helical wheel projection for Helix D. 

Nine of die nine predicted external residues were replaced by alanine. 
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Muiauons at positions 136, 139 and 140 gave rise to proteins that were 
efficiently secreted and had full biological activity (Figure 18, bottom). In 
contrast Ala replacement of either LysUO, Argl43. Serl46, Asnl47 or 
Lysl54 resulted in muteins with significantly (3-fold) increased biological 
5 activity. Replacement of Asnl47 with Ala resulted in the greatest increase in 
biological activity. (Figures 23-25). 

Because the C-terminal boundary of Helix D is uncertain, we tested 
alanine replacements at positions 152, 154 and 156 which are predicted to be 
in the adjacent non-helical segment at the C-terminus. As shown in Figure 19, 
10 tiie Ala replacement at Lysl54 had slightiy increased biological activity while 
tiie Ala replacemem atTyrl56 had slightiy decreased biological activity. 

The experiments reported above all involve testing tiie function of 
human Epo muteins on tiie mouse Epo responsive HCD57 cell line. Figure 20 
shows a comparative display of tiiose muteins tiiat appear to have higher 
15 biological activity tiian wild-type Epo. 

In order to assess tiie significance of tiiese results we have also tested 
diese muteins in a more physiologic system, mouse erytiiroid progenitors 
prepared from tiie spleens of animals challenged witii phenylhydrazine. This 
bioassay obviates tiie possible confounding effects of a malignant clonal cell 
20 line such as HCD57 which was derived from mouse erytiiroleukemia virus. 

• As shown in Figure 21 , results witii eryfliroid cells from mouse spleen 
were very similar to tiiose obtained witii HCD57 cells. 

In addition we have tested tiiese muteins in an Epo responsive human 
cell line UT-7/Epo, derived from a patient witii acute leukemia. This cell line 
25 provides die opportunity of examining tfie interaction of human muteins witii 
tiie human Epo receptor. As shown of ordinary in Figure 22, tiiese results 
closely parallel tiiose witii die two mouse cell bioassays. 

It should be noted diat in several of die dose response curves, at die 
highest concentrations of Epo tested, diere is a significant drop-off in die 
30 incorporation of pH]-tiiymidine. This is due to die fact tiiat high levels of 
biologically active Epo causes terminal maturation and cessation of division. 



wo 94/24160 



PCTAJS94/04361 



-70- 



Because of this possibly confounding phenomenon, it is important to have, as 
in the experiments that we report here, a full dose response curve for each 
mutein with an adequate number of data points. 

Taken together our results show that a restricted number of residues on 

5 Helix A and on Helix D are involved in Epo's biological activity. Further 
experiments are in progress to determine if the observed alterations in 
biological activity can be explained by parallel changes in the binding affinity 
to the Epo receptor. In addition, at the sites that we have identified as 
involved in Epo's biological activity, we plan to make a set of additional 

10 replacement muteins in order to further refine the ligand-receptor interaction. 
We are particulariy interested in developing muteins with maximally increased 
biological activity, e.g. by preparing compound or multiplex replacements, 
each of which confers optimal enhancement of biological activity. Such a 
"superEpo" could have improved utility in the therapy of anemias. 

J5 Conclusions 

We have produced a number of erythropoietin muteins comprising 
native erythropoietin with one or more modifications which improves the 

biological activity. 

• All references mentioned herein are incorporated by reference into this 

20 di^losure. 

Having now fiilly described the invention by way of illustration and 
example for purposes of clarity and understanding, it will be apparent to those 
of ordinary skill in the art tiiat certain changes and modifications may be 
practiced within Uie scope of the invention, as limited only by die following 

25 claims. 
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SEQOENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Brigham and Women's Hospital 
75 Francis Street 
Boston, MA 02115 

INVENTORS: Boissel, Jean-Paul R. 
Bunn. H. Franklin 
Wen, Danyi 
Showers, Mark O. 

(ii) TITLE OF INVENTION: Erythropoietin Muteins With Enhanced 
Activity 

(iii) NUMBER OF SEQUENCES: 59 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox 

(B) STREET: 1100 New York Avenue, Suite 600 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
{B) COMPUTER: IBM, PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version »1. 2b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (to be assigned) 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/049,802 

(B) FILING DATE: 21-APR-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

• (A) NAME: Cimbala, Michele A. 
<B) REGISTRATION NUMBER: 33,651 
(C) REFERENCE/DOCKET NUMBER: 0627.336PC01 

(ix)" TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATWGAATGAA SGC 

(2) INFORMATION FOR SEQ ID N0:2: 



13 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
AWGGTNSGGG 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TGCTTCTGCT ATCTTTGCTG CTGC 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
<B) LOCATION: 12.. 13 

(D) OTHER INFORMATION: /label= Peptide 

/note= "The amino acid at position 12 may also be 
isoleucine. The amino acid at position 13 may 
also be aspartic acid." 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg Met Glu Val Gly 
15 10 15 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGAAGTTTGG CCGAGAAGTG GATGC 25 
(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS ! single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
AAGAKGTACC TCTCCAGRAC TCG 
(2) INFORMATION FOR SEQ ID N0:7: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
CTGCTCCACT CCGAACAMTC AC 22 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CTGGAGTGTC CATGGGACAG 20 
(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
AGGCGCGGAG ATGGGGGTGC 20 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGGTAGCTGG GGGTGGGGTG TGCACACGGC AGCAGGATTG AATGAAGGCC AGGGAGGCAG 60 

CACCTGAGTG CTTGCATGGT TGGGGACAGG AAGGACGAGC TGGGGCAGAG ACGTGGGATG 120 

AAGGAAGCTG TCCTTCCACA GGCCACCCTT CTCCCTCCCC GCCT6ACTCT CAGCCTGGCT 180 

ATCTGTTCTA GAATGTCCTG CCTGGCTGTG GCTTCTCCTG TCCCTGCTGT CGCTCCCTCT 240 
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GGGCCTCCCA GTCCTGGGCG CCCCACCACG CCTCATCTGT GACAGC 
(2) INFORMATION FOR SEO ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

CTCGTAGTTG TTGGGTGGCG TGTGCACGCG GTCAGTGTAT TGAATGAA6G CAAAGGAGGC 60 

AGAGCCTGCG CGCTCGCAAG GTTGGGGTCG GAAACGGCTG GCTGGGGGCA AGACGGCCGG 120 

ATGGGTGAAC CTGTCCGTCT AAACCAACCC TTCTCCTGCA ATGCCCAGCC TCACATTCAG 180 

CCTGGCTCTC TTTCCTAGAA TGTCCTGCCC GGCTGCTTCT GCTATCTTTG CTGCTGCTTC 240 

CTCTGGGCCT CCCAGTCCTG GGCGCCCCCC CACGCCTCAT CTGTGACAGC 290 
<2) INFORMATION FOR SEQ ID N0xl2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: both 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTAGCTCGGG GGGTGGGGTG TGCACGCGGC CGCGGGATTG AATGAAGGCA AAGGAGGCAG 60 

AGCCTGAGCG CTCGCAAGGT TCGGGTCGGG AGGACTCGCG GGGCCAGAGC AGCGGGATGG 120 

GTGAACGTGC AGGTCCAAAC CACCCCTTCT CCTGCAAGTC CCGGCCTCAC ACTCAGCCCA 180 

GCTCTCTCTC CTAGAATGTC TTCTGCTGCT TCTGCTGCCC TCCTTGCTGC TGCTTCCTCT 240 

GGGCCTCCCA GTCCTGGGCG CCCCCACACA CCTCATCTGT GACAGC 286 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGTAGCTGT TGGGGTGGGG TGCGCACGAA GCCGCGGGAT TGAATGAACG CGGAGGCAGA 
GCTGAGCGCT CCCAAGGTCC GGGTGGGGAG GGCTCACTGG GGGCTGCGCA GCGGGATGGG 
TGAACCTGCC GGTCCAAAGC AGCCCTTCTC CTGCGAGAAT GCACCTCATC CGCAGCCCGT 
CTCTCTTTCC TAGACTGTAC TCCGCTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT 
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CTGGGCCTCC CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGC -*^2B8 

(2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 base paire 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TGGGGGTGGG GTGTGCAGGC GGCAGCAGGA TTGAATGAAG GCAGAGGA6A CAGACCCTGA 60 

GCGCTGGGAA GGTTGGGGGC AGGAGCCACT AGCTGGGGGC AGAGGAGGGG GATGCGTGAA 120 

CCTGCCCCTC CAAACCAACC CTCCCCTTCC CCTCCCTGCC TCACACTCAG CCCGGATCTC 180 

CTTTCCAGAA TGTCCTGCCC TGCTGCTTCT GCTGTCCCTG CTACTGCCTC CTCTGGGCCT 240 

CCCAGCCCTG GGCGCCCCTC CACGCCTCAT CTGTGACACC 280 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

CAGGTCAGTT TGATTGGGTG GATGTGTGCC GCGAAGCAGC GGCCGGATTG AATGAAGGCA €0 

GGGGAGGCAG AACCTGAACG CTGGGAAGGT GGGGGTCGGG CGCGACTAGT TGGGGGCAGA 120 

GGAGCGGGAT GTGTGAACCT GCCCCTCCAA ACCCACACAG TCAGCCTGGC ACTCTTTTCC 180 

AGAATGTCCT GCCCTGTTCC TTTTGCTGTC TTTGCTGCTG CTTCCTCTGG ACCTCCCAGT 240 

CCTGGGCGCC CCCCCTCGCC TCATTTGTGA CAGC 274 
(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TGGTAGTTTG GCGGGTGGTG TGTGCTCACC GCGGCGGCGG CGGGGATTGA ATGAAGGCCA 60 

AGGAGGCAGA ACCCGAGCGC TGGGAAGGTT CGGGGTGGGA GCGACAAGCT GGGGGCAGAG 120 

GAACGGGATG TGTGAACCTG CTCCTCCAAA CACACTCAGC CTGGCACTCT TTTCTAGAAT 180 

GTCCTGCCCT GCTGCTTCTG CTATCTTTGC TGCTGCTTCC CCTGGGCCTC CCAGTCCTGG 240 

GCGCCCCCCC TCGCCTCATC TGTGACAGC 269 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GGTAGTTTGG CGGGTGGGGT GTGCTCACCG CGGCGGCGGG GGGGATTGAA TGAAGGCCAA 60 

GGAGGCAGAA CCCGAGCGCT GGGAAGGTTC GGGGTGGGAG CGACAAGCTG GGGGCAGAGG 120 

AACGGGATGT GTGAACCTGC TGCTCCAAAC ACACTCAGCC TGGCACTCTT TTCTAGAATG 180 

TCCTGCCCTG CTGCTTCTGC TATCTTTGCT GCTGCTTCCC CTGGGCCTCC CAGTCCTGGG 240 

CGCCCCCCCT CGCCTCATCT GTGACAGC 268 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(b) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

CGGTCGCTTG GGGGGTGGGG TGTGCAGCGC GGAGGGATTG AATGAAGGCC ACTCAGGCAG 60 

AGCCTAAGCA ATTGCAAGGT CGGGGTCAGC AGAGACTATA AGGGCAGAGG GGTCTCGCTG 120 

AGCCAACCGG CCCCTGCAGT CCCACGGCCC CCTCCCCTCT CGGCCTCACA CTCAGCCTGC 180 

CTTTCTTCCA GAACGTCCCA CCCTGCTGCT TTTACTCTCC TTGCTACTGA TTCCTCTGGG 240 

CCTCCCAGTC CTCTGTGCTC CCCCACGCCT CATCTGCGAC AGT 283 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i)' SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
ATGGTGGCTT GGGGGGTGGT GTGTGCAGCG CCGGGAGATA GAATGAAGGC CACTCAGGCA 60 
GAGCCTAAGC AGTTGCAAGG TCGGGGTCAG CAGAGACTGG AAGGGCAGAG GAGCCTCGCT 120 
GTGCAAACCG GTCTCTGGTC GTCCCACGGC CCTCCCCTCC CAGCCTCACA CCCAGCCTGC 180 
CTTTCTTCTA GAACGTCCAA CCCTGCTGCT TTTGCTGTCT TTGCTCCTGC TTCCCCTGGG 240 
CCTCCCAGTC CTCTGCGCTC CCCCACGCCT CATCTGCGAC AGC 283 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
GGGTGGTGGC GATGAATGAA GCAGCAGCTA GGTGGGGGGC GGTGAAGTAC AGCCGTCTCA 
GAGTGTCTTC TCTCTGCCCT GGCCTCCCAG CTGGCCCCCC CTCATTGGAC AG 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

AGGCGCGGAG ATGGGGGTGC ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCCCTGCT 60 

GTCGCTCCCT CTGGGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT GTGACAGCCG 120 

AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG AATATCACGA CGGGCTGTGC 180 

TGAACACTGC AGCTTGAATG AGAATATCAC TGTCCCAGAC ACCAAAGTTA ATTTCTATGC 240 

CTGGAAGAGG ATGGAGGTCG GGCAGCAGGC CGTAGAAGTC TGGCAGGGCC TGGCCCTGCT 3 00 

GTCGGAAGCT GTCCTGCGGG GCCAGGCCCT GTTGGTCAAC TCTTCCCAGC CGTGGGAGCC 360 

CCTGCAGCTG CATGTGGATA AAGCCGTCAG TGGCCTTCGC AGCCTCACCA CTCTGCTTCG 420 

GGCTCTGGGA GCCCAGAAAG AAGCCATCTC CCCTCCAGAT GCGGCCTCAG CTGCTCCACT 480 

CCGAACAATC ACTGCTGACA CTTTCCGCAA ACTCTTCCGA GTCTACTCCA ATTTCCTCCG 540 

GGGAAAGCTG AAGCTGTACA CAGGGGAGGC CTGCAGGACA GGGGACAGAT GACCAGGTGT 600 

GTCCACCTGG GCATATCCAC CACCTCCCTC ACCAACATTG CTTGTGCCAC ACCCTCCCCC 660 

GGCCACTCCT GAACCCCGTC GAGGGGCTCT CAGCTCAGCG CCAGCCTGTC CCATGGACAC 720 

TCCAG 725 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCTCTCGT GTCGCTCCCT CTGGGCCTCC 60 

CAGTCCCGGG CGCCCCACCA CGCCTCGTCT GTGACAGCCG AGTCCTGGAG AGGTACCTCT 120 

TGGAGGCCAA GGAGGCCGAG AATGTCACGA TGGGCTGTTC CGAAAGCTGC AGCTTGAATG 180 
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AGAATATCAC CGTCCCACyVC ACCAAAGTTA ACTTCTATGC CTGGAAGAGG ATAGAGGTCG 240 

GGCAGCAGGC TGTAGAAGTC TGGCAGGGCC TGGCCCTGCT CTCAGAAGCT GTCCTGCGGG 300 

GCCAGGCCGT GTTGGCCAAC TCTTCCCAGC CTTTCGAGCC CCTGCAGCTG CACATGGATA 360 

AAGCCATCAG TGGCCTTCGC AGCATCACCA CTCTGCTTCG GGCGCTGGGA GCCCAGGAAG 420 

CCATCTCCCT CCCAGATGCG GCCTCGGCTG CTCCACTCCG AACCATCACT GCTGACACTT 480 

TCTGCAAACT CTTCCGAGTC TACTCCAATT TCCTCCGGGG AAAGCTGAAG CTGTACACGG 54 0 

GGGAGGCCTG CAGGAGAGGG GACAGATGAC CAGGTGCGTC- CAGCTGGGCA CATCCACCAA 600 

CTCCCTCACC AACACTGCCT GTGCCACACC CTCCCTCACC ACTCCCGAAC CCCATCGAGG 660 

GGCTCTCAGC TAAGCGCCAG C 681 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



CCGAACGTCC 


CACCCTGCTG 


CTTTTACTCT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 


60 


TCCTCTGTGC 


TCCCCCACGC 


CTCATCTGCG ACAGTCGAGT TCTGGAGAGG TACATCTTAG 


120 


AGGCCAAGGA 


GGCAGAAAAT 


GTCACGATGG GTTGTGCAGA AGGTCCCAGA CTGAGTGAAA 


180 


ATATTACAGT 


CCCAGATACC 


AAAGTCAACT TCTATGCTTG GAAAAGAATG GAGGTGGAAG 


240 


AACAGGCCAT 


AGAAGTTTGG 


CAAGGCCTGT CCCTGCTCTC AGAAGCCATC CTGCAGGCCC 


300 


AGGCCCTGCT 


AGCCAATTCC 


TCCCAGCCAC CAGAGACCCT TCAGCTTCAT ATAGACAAAG 


360 


CCATCAGTGG 


TCTACGTAGC 


CTCACTTCAC TGCTTCGGGT ACTGGGAGCT CAGAAGGAAT 


420 


TGATGTCGCC 


TCCAGATACC 


ACCCCACCTG CTCCACTCCG AACACTCACA GTGGATACTT 


480 


TCTGCAAGCT 


CTTCCGGGTC 


TACGCCAACT TCCTCCGGGG GAAACTGAAG CTGTACACGG 


540 


GAGAGGTCTG 


CAGGAGAGGG 


GACAGGTGAC ATGCTGCTGC CACCGTGGT6 GACCGACGAA 


600 


CTTGCTCCCC 


GTCACTGTGT 


CATGCCAACC CTCCACCACT CCCAACCCTC ATCAAAC6GG 


660 


TCATTACCTT 


CTTACCAGT 




679 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGAACGTCC CACCCTGCTG CTTTTACTAT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 60 
TCCTCTGCGC TCCCCCACGC CTCATTTGCG ACAGTCGCGT TCTGGAGAGG TACATCTTGG 120 
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AGGCCAAGGA GGCAGAAAAT GTCACAATGG GCTGTGCAGA AGGTCCCAGA CTGAGTGAGA 180 

ATATTACCGT CCCAGATACC AAAGTCAACT TCTACGCTTG GAAAAGAATG AAGGTGGAAG 240 

AACAGGCTGT AGAAGTTTGG CAAGGCCTGT CTCTGCTCTC AGAAGCCATC CTGCAGGCCC 300 

AGGCTCTGCA GGCCAATTCC TCCCAGCCAC CAGAGAGTCT TCAGCTTCAT ATAGACAAAG 360 

CCATCAGTGG GCTACGTAGC CTCACTTCAC TGCTTCGGGT GCTGGGAGCT CAGAAGGAAT 420 

TGATGTCGCC TCCAGACGCC ACCCAAGCCG CTCCACTCCG AACACTCACA GCGGATACTT 480 

TCTGCAAGCT CTTCCGGGTC TACTCCAACT TCCTCCGGGG -GAAACTGAAG CTGTACACGG 540 

GGGAGGCCTG CAGGAGAGGG GACAGGTGAC CTGCCACTGC CGTGTACCCG CCAACTCGCT 600* 

CACCGTCACT GTGTCACGCC AACCCTCCAC CACTCCCAAC CCTCATCAAA CGGGGTTGTT 660 

TGTTACCTTC TTACCGGC 678 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GCGACTGTAC TCCGCTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT CTGGGCCTCC 60 

CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGCCG AGTCCTGGAG AGGTACATCC 120 

TGGAGGCCAG GGAGGCCGAA AATGCCACGA TGGGCTGTGC AGAAGGCTGC AGCTTCAGTG 180 

AGAATATCAC TGTCCCAGAC ACCAAGGTTA ACTTCTATGC CT6GAAGAGG ATGGAGGTCC 240 

AGCAGCAGGC TCTGGAAGTC TGGCAGGGCC TGGCTCTGCT CTCAGAAGCT ATCTTTCGGG 300 

GCCAGGCCCT ACCGGCCAAC GCATCCCAGC CATGCGAGGC CCTGCGGTTG CACGTGGACA 360 

AAGCTGTCAG CGGCCTCCGC AGTCTCACCT CCCTGCTTCG GGCGCTGGGA GCCCAGAAAG 420 

AAGCCATCCC' CCTTCCAGAT GCAACCCCCT CCGCAGCCCC ACTCCGAATA TTCACTGTTG 480 

ATGCTTTGTC CAAGCTCTTC CGAATCTACT CCAATTTCCT GAGGGGAAAG CTGACGCTGT 540 

ACACAGGGGA GGCCTGCAGG AGAGGGGACA GGTGACCCAG TGCTATCACC CCGGGCACGT 600 

CCATCACCTC GCTCACCATC ACTGCCTACG CCATGCCTTC CACGCCGCCA CTCCCAACCC 660 

CTGTCGACGA CGGGCCACCA GCTCAGT 687 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 
GAATGTCCTG CCCGGCTGCT TCTGCTATCT TTGCTGCTGC TTCCTCTGGG CCTCCCAGTC 60 
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CTGGGCGCCC CCCCACGCCT CATCTGTGAC AGCCGAGTCC TGGAGAGGTA CATCTTGGAG 120 

GCCAAGGAGG GCGAAAATGC CACGATGGGC TGTGCCGAAA GCTGCAGCTT CAGTGAGAAT 180 

ATCACTGTCC CAGACACCAA GGTTAACTTC TATGCCTGGA AGAGGATGGA GGTCCAGCAG 240 

CAGGCCATGG AGGTCTGGCA GGGTCTGGCC CTGCTCTCAG AAGCCATCCT GCAGGGCCAG 300 

GCCCTGTTGG CCAACTCCTC CCAGCCATCT GAGGCCCTGC AGCTGCATGT GGACAAAGCT 360 

GTCAGCGGCC TGCGCAGCCT CACCTCCCTG CTTCGGGCGC TGGGAGCCCA GAAGGAAGCC 420 

ATCCCCCTTC CAGACGCATC CCCCTCCTCT GCCACCCCAC TCCGAACATT TGCTGTTGAT 480 

ACTTTGTGCA AACTTTTCCG CAACTACTCC AATTTCCTGC GGGGAAAGCT GACGCTGTAC 540' 

ACTGGAGAGG CCTGCAGGAG AAGGGACAGG TGACTTGGTG CTCCCACCCG GGGCATGTCC 600 

ACCACTTCGC TCACCACCAC TGCCTGTGCC ATGCCTTCTG CACCTCCACT CCCAACCCCC 660 
GCTGAGGGGC CATCAGCTCA GCGCCAGC 
(2) INFORMATION FOR SEO ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 baee paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

GAATGTCCTG CCCTGCTGCT TCTGCTATCT TTGCTGCTGC TTCCCCTGGG CCTCCCAGTC 60 

CTGGGCGCCC CCCCTCGCCT CATCTGTGAC AGCCGAGTCC TGGAGAGGTA CATTCTGGAG 120 

GCCAGGGAGG CCGAAAATGT CACGATGGGC TGTGCTGAAG GCTGCAGCTT CAGTGAGAAT 180 

ATCACTGTCC CAGACACCAA GGTCAACTTC TATACCTGGA AGAGGATGCA CGTCGGGCAG 240 

CAGGCTGTGG AAGTCTGGCA 6GGCCTCGCC CTGCTCTCAG AAGCCATCCT GCGGGGCCAG 300 

GCCCTGCTGG CCAACTCCTC CCAGCCATCT GAGACCCTGC AGCTGCATGT GGATAAAGCC 360 

GTCAGCAGCC TGCGCAGCCT CACCTCCCTG CTTCGGGCAC TGGGAGCCCA GAAGGAAGCC 420 

ACCTCCCTTC CAGAGGCAAC CtCTGCTGCT CCACTCCGAA CATTCACTGT CGATACTTTG 480 

TGCAAACTTT TCCGAATCTA CTCCAACTTC CTGCGGGGAA AGCTGACGCT GTACACAGGG 540 

GAGGCCTGCC GAAGAGGAGA CAGGTGACCA GGTGCTCCTA CCCCGGGCAT GTCCACCACC 600 

TCACTTACCA CCAATGCCTG TGCCACGCCC TCTGCACCAC CACTCCTGAC CCCTGTCGGG 660 
GGTGATCAGC TCAGCACCAG C 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acidG 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
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Met Gly Val His Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
15 10 15 

Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Met Gly Val His Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
I S 10 15 

Val Ser Leu Pro Leu Gly Leu Pro Val Pro Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30i 

Met Gly Val Pro Glu Arg Pro Thr Leu Leu Leu Leu Leu Ser Leu Leu 
15 10 15 

Leu He Pro Leu Gly Leu Pro Val Leu Cya Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 31: 

* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30_ amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi)- SEQUENCE DESCRIPTION: SEQ ID N0:31: 

Met Gly Val Arg Asp Cys Thr Pro Leu Leu Leu Leu Leu Leu Ser Leu 
X 5 10 15 

Leu Leu Phe Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Glu Cys Pro Ala Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
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Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Glu eye Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Pro Pro Leu 
15 10 

Gly Leu Pro Ala Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

Glu Cys Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
1 5 10 1^ 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
' (D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:35: 

Glu Cve Pro Ala Leu Phe Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
15 10 15 

Asp Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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Glu Cys Leu Leu Leu Leu Leu Leu Pro Ser Leu Leu Leu Leu Pro Leu 
1 5 10 .15 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 166 amino acids 
(6) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn He Thr Thr Gly Cys Ala Glu His 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3B: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 
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Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 . 55 €0 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Val 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Phe Glu Pro Leu Gin Leu His Met Asp 
85 90 

Lye Ala He Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Glu Ala He Ser Leu Pro Asp Ala Ala Ser Ala Ala Pro 
115 120 125 

Leu Arg Thr He Thr Ala Aep Thr Phe Cys Lya Leu Phe Arg Val Tyr 
130 135 140 

Ser Aen Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
145 150 155 ISO 

Arg Arg Gly Aep Arg 
165 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Ala Pro Pro Arg Leu Val Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 

Leu Glu Ala Lye Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

eye Ser Leu Asn Glu Aen He Thr Val Pro Aep Thr Lye Val Aen Phe 
35 40 - 45 

Tyr Ala Trp Lys Arg He Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 €0 

Gin -Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Val 
65 70 75 BO 

Leu Ala Asn Ser Ser Gin Pro. Phe Glu Pro Leu Gin Leu His Met Asp 
85 90 95 

Lve Ala He Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala Leu 

105 110 



100 



Gly Ala Gin Glu Ala He Ser Leu Pro Aep Ala Ala Ser Ala Ala Pro 
lis 120 125 

Leu Arg Thr He Thr Ala Asp Thr Phe Cys Lye Leu Phe Arg Val Tyr 
130 135 140 

Ser Asn Phe Leu Arg Gly Lys Leu Lye Leu Tyr Thr Gly Glu Ala Cys 
145 150 155 150 

Arg Arg Gly Asp Arg 
165 
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(2) INFORMATION FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

Ala Pro Pro Arg Leu He Cye Aop Ser Arg Val Leu Glu Arg Tyr He 
1 5 10 15 

Leu Glu Ala Lye Glu Ala Glu Asn Val Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Asn He Thr Val Pro Asp Thr Lye Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Glu Glu Gin Ala He Glu Val Trp 
50 55 60 

Gin Gly Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Pro Glu Thr Leu Gin Leu His He Asp 
85 90 95 

Lys Ala He Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
100 105 110 

Gly Ala Gin Lys Glu Leu Met Ser Pro Pro Asp Thr Thr Pro Pro Ala 
115 120 125 

Pro Leu Arg Thr Leu Thr Val Asp Thr Phe Cys Lye Leu Phe Arg Val 
130 135 140 

Tyr Ala Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Val 
145 150 155 160 

Cys Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 166 amino acids 
* (B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION; SEQ ID N0:41: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cye Ala Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Lys Val Glu Glu Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 
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es 70 75 80 

Gin Ala Asn Ser Ser Gin Pro Pro Glu Ser Leu Gin Leu His lie Asp 
85 ' 90 

Lye Ala He Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
^ 100 105 110 

Gly Ala Gin Lys Glu Leu Met Ser Pro Pro Asp Ala Thr Gin Ala Ala 
115 120 12S 

Pro Leu Arg Thr Leu Thr Ala Asp Thr Phe Cye Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 ISO 155 "0 

Cys Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:42: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Arg Glu Ala Glu Asn Ala Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Cys Ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lye Arg Met Glu Val Gin Gin Gin Ala Leu Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Phe Arg Gly Gin Ala Leu 
65 70 75 eO 

Pro Ala Asn Ala Ser Gin Pro Cys Glu Ala Leu Arg Leu His Val Asp 
85 90 95 

Lvs Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
^ 100 105 110 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Asp Ala Thr Pro Ser Ala 
115 120 125 

Ala Pro Leu Arg He Phe Thr Val Asp Ala Leu Ser Lys Leu Phe Arg 
130 135 140 

He Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly Glu 
145 150 155 160 

Ala Cys Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Lys Glu Gly Glu Asn Ala Thr Met Gly Cye Ala Glu Ser 
20 25 30 

Cys Ser Phe Ser Glu Aon He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gin Gin Gin Ala Met Glu Val Trp 
50 55 €0 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Gin Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Ala Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Asp Ala Ser Pro Ser Ser 
115 120. 125 

Ala Thr Pro Leu Arg Thr Phe Ala Val Asp Thr Leu Cys Lys Leu Phe 
130 135 140 

Arg Asn Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly 
145 150 155 160 

Glu Ala Cye Arg Arg Arg Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY I both 



(xi)' SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Arg Glu Ala Glu Asn Val Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Cys Ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Thr Trp Lys Arg Met Asp Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Thr Leu Gin Leu His Val Asp 
85 90 95 
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Lys Ala Val Ser Ser Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala Thr Ser Leu Pro Glu Ala Thr Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr Phe Thr Val Asp Thr Leu Cys Lys Leu Phe Arg He 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lye Leu Thr Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cye Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
IS 10 15 

Leu Glu Ala Lye Glu Ala Glu Asn He Thr Thr Gly Cys Ala Glu Hie 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Aen Phe 
' 35 40 45 

Tyr Ala Trp Lys Arg Met Glu Gly Val Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Val Aen Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu Hie Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
115 120 125 

. Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lye Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Xaa Xaa Ala Lye 
15 10 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val 
15 10 15 

Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 48: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Pro Leu Gin Leu Hia Val Asp Lye Ala Val Ser Gly Leu Arg Ser Leu 
15 10 15 

Thr Thr 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
' (B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:49: 

Thr He Thr Ala Aep Thr Phe Arg Lys Leu Phe Arg "Val Tyr Ser Asn 
15 10 15 

Phe Leu Arg Gly Lys 
20 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
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Glu Asn He Thr Thr Gly Cye Ala 
1 5 

lie Thr Val Pro Asp Thr Lys Val 
20 

Glu Val Gly Gin 
35 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



Glu Hie Cye Ser Leu Aen Glu Asn 
10 15 

Asn Phe Tyr Ala Trp Lys Arg Asn 
25 30 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala lie Ser 
15 10 15 

Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Leu He Ser Glu 
1 5 10 15 

Glu Asp Leu Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala Pro 
20 25 30 

Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 53: 

(iy SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Cys Arg Thr Gly Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Lye Asp Glu Leu 
1 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

His His His His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

Met Gly His Ser Ser Gly Hie He Glu Gly Arg His Met Ala Pro Pro 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 57: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Val 'Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser 
15 10 15 

Asn Phe Leu Arg 
20 

(2) INFORMATION FOR SEQ ID NO:59: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

Gly Lye Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
15 10 
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Claims: 



1 L An erythropoietin mutein comprising native erythropoietin with 

2 one or more of the following modifications: 

3 (a) replacement of the amino acid at position 20 with a first 

4 substitute amino acid; 

5 (b) replacement of the amino acid at position 49 with a 

6 second substitute amino acid; 

7 (c) replacement of the amino acid at position 73 witii a 

8 Uiird substitute amino acid; 

9 (d) replacement of Uie amino acid at position 140 witii a 

10 fourtii substitute amino acid; 

11 (e) replacement of tiie amino acid at position 143 witii a 

12 fifti) substitute amino acid; and 

13 (f) replacement of tiie amino acid at position 146 witii a 

14 sixdi substitute amino acid; 

15 (g) replacement of the amino acid at position 147 witii a 

16 seventii substitute amino acid; 

17 (h) replacement of die amino add at position 154 witii a 

1 8 cightii substitute amino acid; 

19 wherein said erytiiropoietin mutein exhibits enhanced biological activity 

20 compared to said native erytiirqwietin, 

1 2. The erytiiropoietin mutein of claim 1 comprising native 

2 erytiiropoietin witii one of said modifications. 

1 3. The erythropoietin mutein of claim 1 wherein said first, second, 

2 tiiird, fourtii, fifth, sixth, seventii or eightii substitute amino acids are selected 

3 from die group consisting of alanine, serine, tiireonine, and glycine. 
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1 4. The erythropoietin mutein of claim 3 wherein said first 

2 substitute amino acid is serine, and/or said second, third, fourth, fifth, sixth, 

3 seventh and/or eighth substitute amino acids are alanine. 

5 . The erythropoietin mutein of claim 2 wherein the amino acid at 
position 20 is replaced with the amino acid alanine. 

1 6, The erythropoietin mutein of claim 2 wherein the amino acid at 

2 position 49 is replaced with the amino acid serine. 

3 7. The erythropoietin mutein of claim 2 wherein the amino acid at 

4 position 73 is replaced with the amino acid glycine. 

5 8. The erythropoietin mutein of claim 2 wherein the amino acid at 

6 position 140 is replaced with the amino acid alanine. 

7 9. The erythropoietin mutein of claim 2 wherein the amino acid at 

8 position 143 is replaced with die amino acid alanine. 

9 10. The erythropoietin mutein of claim 2 wherein the amino acid at 

10 position 146 is r^laced with the amino acid alamne. 

11. The erythropoietin mutein of claim 2 wherein the amino acid at 
position 147 is replaced with the amino acid alamne. 

1 12. The erytiiropoietin mutein of claim 2 wherein the amino acid at 

2 position 154 is replaced witii the amino acid alamne. 

1 13, A DNA molecule encoding the erythropoietin mutein of 

2 claim 1. 
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1 14. A recombinant DNA construct comprising the DNA molecule 

2 of claim 13. 

1 15. A vector comprising the DNA construct of claim 14 which is 

2 capable of expressing said erythropoietin mutein in a host cell. 

1 16. A host cell comprising the DNA molecule of claim 13 capable 

2 of expressing the erythropoietin mutein encoded by said DNA molecule. 

1 17. The host cell of claim 16 wherein said host cell is capable of 

2 glycosylating said erythropoietin mutein. 

1 18. The host cell of claim 17 wherein said host cell is a mammalian 

2 cell. 

1 19. The host cell of claim 18 wherein said host cell is selected from 

2 the group consisting of Chinese Hamster Ovary cells, Cos7 cells,Cosl cells, 

3 baby hamster kidney cells, and CVl cells. 

1 20. A method of stimulating erythrocyte production in a subject, 

2 wherein said method comprises administering to said subject an efficacious 

3 amount of the erythropoietin mutein of claim 1. 

1 21. A method of stimulating erythrocyte production in a subject. 

2 said method comprising administering to said subject an efficacious amount of 

3 an erythropoietin mutein comprising native erythropoietin with the amino acid 

4 residue at position 143 replaced by an alanine residue, wherein said efficacious 

5 amount is less than the corresponding amount of native erythropoietin. 
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1 22. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said method comprises contacting said 

3 cell culture widi the erythropoietin mutein of claim 1. 

1 23. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said method comprises contacting said 

3 cell culture with an efficacious amount of an erythropoietin mutein comprising 

4 native erythropoietin with the amino acid residue at position 143 replaced by 

5 an alanine residue, wherein said efficacious amount is less than tiie 

6 corresponding amount of native erythropoietin. 

1 24. The metiiod of claim 18 or 19 wherein said cell culture 

2 comprises cells selected from the group consisting of murine spleen cells. 

3 murine erythroleukemia cells, and human UT-7/Epo cells. 

1 25. A metiiod for obtaining erytiiropoietin muteins with enhanced 

2 biological activity relative to native erytiiropoietin, said method comprising tiie 

3 steps: 

4 (a) preparing erythropoietin muteins comprising native erythropoietin 

5 witii a single amino acid substitution, said substitution occurring at a position 

6 selected from tiie group consisting of positions 48-52, 151-160, and positions 

7 predicted to reside on the external surface of helices A and D; 

8 (b) assaying the biological activity of the erytiiropoietin muteins from 

9 step (a) over a range of dosages; and 

10 (c) identifying those erythropoietin muteins witii enhanced biological 

1 1 activity relative to native erytiiropoietin. 
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